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Abstract 

This  thesis  develops  anificial  neural  network  wavefront  reconstructors  for  a  ground-based 
adaptive  optical  imaging  system.  The  incoming  object  wavefront  is  reflected  from  a  21  actuator 
a3ntinuous  faceplate  deformable  minor  onto  a  20  subapenure  Hanmann-Shack  wavefront  sensor. 

Recunent  and  feedforward  artificia.  neural  networks  are  developed  as  wavefront 
reconstmaors.  The  recunent  neural  network  studied  is  the  Hopfield  neural  network  and  the 
feedforward  neural  network  studied  is  the  single  layer  perceptron  aniflcial  neural  network.  The 
recunent  aniflcial  neural  network  input  features  are  the  wavefront  sensor  slope  outputs  and 
neighboring  actuator  feedback  commands.  The  feedforward  artiflcial  neural  network  input  features 
are  just  the  wavefront  sensor  slope  outputs.  Both  aniflcial  neural  networks  use  their  inputs  to 
calculate  deformable  minor  actuator  commands.  The  effects  of  training  are  examined  for  the  single 
layer  perceptron  neural  network. 

The  adaptive  optical  system  is  simulated  by  integrating  simulation  software  provided  by  the 
Phillips  Laboratory  Kinland  AFB,  NM  and  a  computer  model  of  the  neural  network  controller.  The 
simulation  code  conuins  adaptive  optical  system  models  and  an  atmospheric  phase  screen  generator. 

This  thesis  research  supports  the  feasibility  of  integration  of  aniflcial  neural  networks  with 
adaptive  optical  imaging  systems.  The  simulation  results  show  the  performance  of  aniflcial  neural 
network  reconstruaon  are  commensurate  with  conventional  linear  estimation  techniques.  The 
simulation  also  shows,  that  training  aniflcial  neural  networks  can  provide  performance  enhancements 
not  currently  achievable  using  conventional  techniques. 
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AN  INVESTIGATION  OF  THE  APPUCATION  OF 


ARTinCIAL  NEURAL  NETWORKS 
TO  ADAPTIVE  OPTICAL  IMAGING  SYSTEMS 


I.  Introduction 

1.1  Motivation 

The  traditional  Air  Force  mission,  *to  fly  and  fight*  is  becoming  increasingly  dependent  on 
peripheral  technologies.  High  resolution  ground  based  exo-atmospheric  imaging  technology  is  crucial 
to  successful  fulfillment  of  present  and  future  Air  Force  mission  profiles.  The  success  of  high 
resolution  ground  based  exo-atmospheric  imaging  hinges  on  the  solution  to  one  crucial  problem, 
negating  the  effects  of  atmospheric  turbulence. 

Atmospheric  turbulence  significantly  degrades  the  performance  of  exo-atmospheric  imaging 
systems.  Atmospheric  turbulence  produces  image  motion,  distortions,  and  intensity  fluctuations 
(scintillation)  (16:360).  There  are  several  methods  to  compensate  for  turbulence  induced  image 
degradation.  For  ground  based  applications,  the  most  common  methods  are  post  processing  and 
predetection.  Post  processing  compensates  for  image  aberrations  after  image  retrieval.  Predetection 
compensates  for  image  aberrations  in  real-time.  This  thesis  addresses  only  predeteaion  atmospheric 
imaging,  specifically  the  use  of  adaptive  optics. 

Predetection  or  adaptive  optics  uses  real-time  control  to  compensate  for  image  degradations 
encountered  when  imaging  through  turbulence.  Real  time  imaging  is  essential  to  the  exo-atmospheric 
tracking  and  possibly  weapons  delivery  aspects  of  the  Air  Force’s  mission  profile. 

The  adaptive  optics  system  performs  three  basic  functions:  wavefront  sensing,  reconstruction, 
and  compensation.  Wavefront  sensing  measures  the  turbulence  induced  phase  aberrations  of  the 
incoming  wavefront  across  the  system  aperture.  Wavefront  reconstruaion  uses  the  sensed  phase 
aberrations  to  calculate  the  deformable  mirror  actuator  commands.  Wavefront  compensation  receives 
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the  actuator  commands  and  positions  individually  controlled  actuators  on  a  deformable  mirror  to 
counteract  the  effects  of  turbulence. 

The  integration  of  the  artiOcial  neural  network  with  the  adaptive  optical  imaging  system 
occurs  in  the  wavefront  reconstruction  process.  This  thesis  research  develops  'artificial  neural 
network  reconstructors.'  The  artiflcial  neural  network  reconstructors  use  slope  sensor  measurements 
to  calculate  deformable  mirror  actuator  commands. 

The  Air  Force’s  Phillips  Laboratory  is  at  the  forefront  of  the  DOD’s  effort  to  explore  adaptive 
optics  as  a  means  to  compensate  for  the  effects  of  atmospheric  degradation  on  exo-atmospheric 
imaging  systems.  The  Phillips  Laboratory,  who  sponsors  this  thesis,  is  actively  investigating  the  use 
of  adaptive  optical  systems  to  image  exo-atmospheric  objects,  such  as  satellites. 

1.2  Approach 

The  goal  of  this  thesis  is  to  demonstrate  that  artificial  neural  networks  can  accomplish 
adaptive  optical  wavefront  reconstruction.  Artiffcial  neural  networks  have  many  features  that  can 
potentially  benefit  present  and  future  adaptive  optical  imaging  systems.  The  artificial  neural  network 
combines  the  features  of  a  highly  parallel  architecture  and  extremely  fist  computational  speed  to 
provide  significant  advantages  over  conventional  wavefront  reconstruction  techniques.  The  added 
feature  of  artificial  neural  network  training  can  provide  accurate  system  output  estimates  under  a  wide 
variety  of  system  operating  environments.  To  accomplish  the  research  goal,  the  following  tasks  must 
be  completed: 

1.  Determine  the  types  of  artificial  neural  networks  to  use  as  research  baselines. 

2.  Develop  artificial  neural  network  wavefront  compensation  algorithms  using  conventional 
adaptive  optics  control  techniques. 

3.  Implement  adaptive  optical  wavefront  compensation  using  the  artificial  neural  network 
compensation  algorithms. 

4.  Simulate  and  evaluate  the  system  performance  by  comparing  with  conventional  linear 
estimation  techniques. 
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5.  Demonstrate  the  potential  benefits  of  artificial  neural  networks. 

The  first  task  starts  with  a  delineation  between  the  types  or  classes  of  artificial  neural  networks  that 
are  candidate  wavefront  reconstruaors.  The  two  classes  of  neural  networks  researched  are  the 
feedforward  and  recurrent  artificial  neural  network.  This  thesis  analyzes  a  representative  artificial 
neural  network  from  each  class.  The  baseline  artificial  neural  networks  chosen  were  the;  Hopfieid 
neural  network  for  the  recurrent  class  and  the  single  layer  perceptron  for  the  feedforward  class. 

The  second  task  maps  the  adaptive  optics  wavefront  reconstruaion  process  into  each  neural 
network  state  space.  Recurrent  and  feedforward  neural  networks  have  unique  properties  which  govern 
their  implementations  as  wavefront  reconstructors.  Each  neural  network  is  studied  to  determine  the 
best  means  of  integration  with  the  adaptive  optical  control  system. 

The  third  task  develops  software  models  of  the  artificial  neural  network  wavefront 
reconstructor.  The  artifrcial  neural  network  models  are  integrated  with  a  software  simulation  package 
provided  by  the  Phillips  Laboratory,  Kirtland  AFB,  NM.  The  Phillips  Laboratory  simulation  package 
conuins  the  adaptive  optical  system  models  and  control  algorithms  for  conventional  adaptive  optics 
compensation.  Modifications  are  made  to  the  simulation  code  to  replace  the  conventional 
reconstructor  with  the  artificial  neural  network  reconstructor. 

For  the  fourth  task,  an  unmodified  version  of  the  simulation  code  is  the  comparison  baseline. 
The  unmodified  version  uses  a  conventional  linear  es’imation  technique  known  as  the  "least  squares 
reconstructor."  System  performance  comparisons  are  made  between  the  results  from  the  least  squares 
reconstructor  and  the  artificial  neural  network  reconstructor. 

The  final  task  demonstrates  the  potential  benefits  of  artificial  neural  network  compensation. 
The  feature  of  artificial  neural  network  training  is  used  to  show  how  artificial  neural  networks  can 
provide  performance  enhancements  not  currently  achievable  using  conventional  reconstruction 
techniques. 
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1.3  Scope 

This  thesis  is  the  initial  step  into  the  integration  of  artificial  neural  networks  and  adaptive 
optical  imaging  at  AFTT.  Several  estimates  and  assumptions  are  used  to  expedite  progress  and 
complete  the  research  tasks  in  the  allotted  time.  Most  of  the  estimates  and  assumptions  have  their 
roots  based  in  precedents  found  in  the  literature.  Some  of  the  assumptions  are  imposed  by  the 
software  simulation  code  and  others  are  pure  engineering  judgement 

This  thesis  assumes  that  the  adaptive  optical  imaging  system  is  a  ground  based  system.  A 
separate  system  tracks  the  objea  and  its  associated  reference  or  guide  star.  The  objea  and  guide  star 
are  in  close  proximity  to  one  another,  therefore  adaptive  optic  compensation  for  the  guide  star  also 
compensates  for  the  object.  A  separate  imaging  system  removes  the  overall  piston  and  tilt  of  the 
incoming  wavefront. 

The  simulation  code  produces  a  random  phase  screen  developed  fiom  a  technique  introduced 
by  Cochran  (4).  This  phase  screen  simulation  produces  phase  screens  whose  struaure  funaions,  when 
computed  over  a  large  ensemble,  corresponds  to  a  Kolmogorov  atmosphere  (13:3S9-391).  This 
simulation  code  also  contains  models  for  each  component  of  the  adaptive  optical  imaging  system.  The 
wavefront  sensor  model  is  patterned  after  the  Hartmann-Shack  sensor  (HS  WFS),  which  produces 
slope  outputs.  The  detection  process  produces  random  additive  noise  which  corrupts  the  slope 
outputs.  The  noise  and  signal  are  uncorrelated.  The  deformable  mirror  is  modeled  as  a  continuous 
face  sheet  mirror  with  Gaussian  aauator  influence  functions.  This  thesis  neglects  the  effects  of  a  time 
delay  between  the  wavefiont  sensing  and  correaion.  The  adaptive  optical  system  geometry  used  for 
this  thesis  has  20  wavefront  sensor  subapertures  and  21  deformable  mirror  actuators  (Figs  1.1,  1.2). 
This  particular  geometry  is  an  artifact  of  the  method  used  by  the  simulation  code  to  determine  the 
actuator  and  subaperture  locations.  The  operational  parameters  used  in  this  thesis  are  shown  in  Table 
I. 

The  Hopfield  neural  network  is  modeled  (Fig.  1.3)  as  a  discrete  time  system  with  synchronous 
updates  (32).  This  means  the  updates  to  the  Hopfield  outputs  and  inputs  are  made  at  the  same  time 
and  time  is  modeled  as  discrete  steps.  The  neuronal  outputs  of  the  Hopfield  network  correspond  to 
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Figure  1.1  Deformable  Mirror  Actuator  Geometry 


Figure  1.2  Hartmann  Wavefront  Sensor  Subaperture  Geometry 

the  individual  aauator  commands  to  the  deformable  mirror.  The  input  features  to  a  neuron  are  the 
weighted  slopes  from  the  wavefront  sensor  and  neighboring  actuator  feedback  commands  (Fig.  1.4). 
To  produce  an  output  command,  the  weighted  inputs  are  summed  and  passed  through  a  soft  limiter 
aaivation  function. 


1-5 


Tabk  I  System  Parameters 


Parameter 

Value 

Wavelength  (A) 

500  nm 

Actuator  Separation  (actsep) 

15  cm 

Mirror  Diameter  (DJ 

75  cm 

Light  Level  (Np) 

1-1000 

photoevents/subap/ 
integration  time 

Atmospheric  Coherence  Diameter  (r„) 

7.5,  10,  20  cm 

Actuator  Outints  (1-21) 


ngare  1.3  Hopfield  Neural  Network 

The  neuronal  outputs  of  the  single  layer  perceptron  (Fig.  1.5)  are  the  actuator  commands  to 
the  deformable  mirror.  For  the  single  layer  perceptron,  the  neuronal  inputs  are  the  weighted  slope 
sensor  outputs  (Fig.  1.6).  The  output  nonlinearity  or  activation  function  used  is  a  sigmoid  or 
'squashing  funaion.” 
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Actuator  Ootpata  (1*21) 


«l  «2  C2I 


Figure  1.5  Single  Layer  Perceptron  Artificiai  Neural  Network 


1.4  Standards 

This  thesis  analyzes  the  system  performance  in  response  to  variations  of  both  system  and 
hardware  parameters  of  the  adaptive  optical  imaging  system.  The  system  parameters  are  the  light 
level  and  seeing  conditions.  The  hardware  parameter  is  the  amount  of  interconneaions  from  the 
wavefront  sensor  to  the  wavefront  reconstructor. 
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Figure  1.6  Single  Layer  Perceptron  Neuron 

As  a  final  product,  this  thesis  contrasts  the  performance  between  the  conventional  least 
squares  reconstruaion  algorithm  and  the  artificial  neural  network  reconstructors.  The  primary  figure 
of  merit  used  is  the  mean  square  residual  phase  error  between  the  estimated  and  the  actual  wavefront. 


1.5  Chapter  Outlines 

The  following  is  a  brief  synopsis  of  the  information  found  in  each  of  the  thesis  chapters. 

1.5.1  Chapter  Z  Chapter  2  presents  the  fundamental  concepts  behind  adaptive  optical 
imaging  systems  and  artificial  neural  networks.  These  fundamental  concepts  are  essential  to 
understanding  the  problems  that  plague  the  adaptive  optical  imaging  system  and  a  possible  somution 
using  artificial  neural  networks.  The  concepts  presented  in  this  chapter  serve  as  the  basis  for  the 
algorithms  and  analysis  presented  in  chapter  3. 

1.5.2  Chapter  3.  This  chapter  uses  the  background  knowledge  provided  in  chapter  2  to 
develop  feedforward  and  recurrent  reconstructor  models.  Each  reconstruaor  model  is  then  validated 
as  being  either  a  Hopfield  or  a  single  layer  perceptron  artificial  neural  network. 
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1.5.3  Chapter  4.  This  chapter  presents  the  results  of  the  analysis  and  simulation.  The  task 
of  analyzing  and  simulating  the  adaptive  optical  imaging  system  is  an  intricate  process.  To  adequately 
address  each  speciHc  part,  the  entire  simulation  process  was  divided  into  specific  experiments.  Each 
experiment  and  the  results  are  discussed  in  detail. 

1.5.4  Chapter  5.  This  chapter  states  the  conclusions  based  upon  the  results  provided  in 
Chapter  4.  The  chapter  concludes  with  recommendations  for  areas  of  future  study. 

1.6  Summary 

The  adaptive  optics  imaging  system  presents  a  very  complex  control  system  problem.  A 
proposed  solution  is  the  use  of  an  artificial  neural  network  wavefront  reconstructor.  This  thesis 
investigates  the  application  of  two  different  types  of  artificial  neural  networks  (i.e.  Hopfleld  and  single 
layer  perceptron  artificial  neural  network)  to  the  adaptive  optical  imaging  system.  The  artificial 
neural  network  reconstruaor  performance  and  potential  performance  enhancements  using  artificial 
neural  network  training  are  presented. 
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II.  Background 


11  Introduction 

This  chapter  presents  the  fundamental  concepts  behind  adaptive  optical  imaging  systems  and 
artificial  neural  networks.  These  fundamental  concepts  are  essential  to  understanding  the  problems 
that  plague  the  adaptive  optical  imaging  system  and  a  possible  solution  using  artificial  neural 
networks.  The  concepts  presented  in  thesis  chapter  serve  as  the  basis  for  the  algorithms  and  analysis 
presented  in  the  next  chapter. 

This  chapter  provides  the  background  concepts  needed  to  obtain  a  fundamenul  understanding 
of  the  problem  (atmospheric  turbulence),  system  (adaptive  optical  imaging  system),  and  proposed 
solution  (artificial  neural  networks).  Prior  to  developing  artificial  neural  network  reconstructors,  the 
following  research  questions  must  be  answered: 

1)  What  is  the  origin  of  the  problem  ? 

2)  How  does  the  adaptive  optical  imaging  system  work  ? 

3)  How  does  the  problem  influence  the  system  ? 

4)  Is  there  an  alternate  solution  to  the  problem  ? 

5)  How  does  the  solution  work  ? 

6)  Has  the  solution  been  implemented  ? 

7)  How  will  this  research  effort  extend  the  present  implementation  ? 

This  chapter  sequentially  addresses  these  fundamenul  questions.  Research  question  1  is  answered 
in  seaion  2.2.  Section  2.2  provides  an  overview  of  the  concepts  of  the  most  important  problem  facing 
present  and  future  optical  imaging  systems;  negating  the  effects  of  atmospheric  turbulence  (research 
question  1).  Section  2.2  discusses  why  phase  only  correction  is  used  to  combat  the  effects  of 
atmospheric  turbulence.  Section  2.2  also  introduces  a  useful  measure  of  the  image  degradation  effects 
introduced  by  atmospheric  turbulence,  the  atmospheric  coherence  diameter  (r  J.  Section  2.3  answers 
research  question  2  by  defining  three  basic  functional  elements  (sensing,  reconstruction,  and 
compensation)  of  the  adaptive  optical  imaging  system  and  how  these  functional  elements  are  currently 
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implemented.  The  system  limiutions  imposed  by  atmospheric  turbulence  are  presented  in  section  2.4 
(research  question  3).  Research  question  4  is  addressed  in  section  2.5  by  presenting  a  brief  discussion 
of  how  artificial  neural  networks  represent  an  alternate  solution  to  the  problem.  The  theory  behind 
the  solution  (artificial  neural  networks)  is  presented  in  seaion  2.6  (research  question  5).  Research 
question  6  is  answered  in  section  2.7  by  discussing  the  current  implementations  of  the  artificial 
neural  network  solution.  This  chapter  concludes  by  presenting  the  direction  that  this  research  effort 
will  take  to  address  the  final  research  question. 

22  Turbulence 

This  section  discusses  the  origin  of  the  problem  (research  question  1)  facing  adaptive  optical 

imaging  system.  The  origin  of  the  problem  is  atmospheric  turbulence.  Atmospheric  turbulence 
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Figure  2.1  Turbulent  Eddy  Represenution  of  the  Atmosphere 

degrades  the  performance  of  optical  imaging  systems.  The  aberrations  produced  by  the  atmosphere 
are  caused  by  inhomogeneities  of  the  refractive  index  of  air.  Differential  heating  of  the  Earths’ 
surface  causes  large  scale  temperature  inhomogeneities.  These  large  scale  inhomogeneities  are  broken 
up  into  smaller  scale  "eddies’  by  convection  and  turbulent  wind  flow.  Each  turbulent  eddy  has  its  own 
individual  refractive  index  (13:388).  The  localized  refractive  indices  temporally  and  spatially  modulate 
the  amplitude  and  phase  of  a  propagating  wavefront.  Figure  2.1  graphically  displays  the  effect  of  the 
turbulent  eddies  on  plane  wave  propagation.  The  plane  wave  propagating  through  the  atmosphere 
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is  sensed  as  an  aberrated  wave  by  the  imaging  system.  The  aberrations  are  due  to  amplitude  and 
phase  modulations  induced  oy  atmospheric  turbulence. 

The  amplitude  modulation  effects  are  known  as  scintillation.  An  example  of  scintillation  is 
the  twinkling  of  stars  (34:224).  The  scintillation  effects  can  be  considered  negligible  when  the 
following  condition  is  satisfied  (36:2580) 

y/Lr<D^  (2-1) 


where  L  is  the  objects  distance  from  the  exit  pupil,  is  the  pupil  diameter,  and  A  is  the  wavelength. 
Although  this  condition  is  not  strictly  met  at  high  altitudes,  much  of  the  literature  suggests  that  the 
amplitude  modulation  effects  are  less  severe  than  the  phase  modulation  effects  for  ground  based 
vertical  imaging  through  the  atmosphere  (9)  (11)  (25)  (38).  Therefore,  by  only  compensating  for  this 
phase  aberration,  the  object  image  can,  in  theory  be  reconstruaed.  This  research  effort  focusses  only 
on  the  phase  compensation  aspect  of  adaptive  optical  systems. 


22/  Atmospheric  Coherence  Diame’er.  The  atmospheric  coherence  diameter  or  Fried 
parameter  is  a  useful  measure  of  the  image  degradation  caused  by  atmospheric  turbulence.  The 
atmospheric  coherence  diameter  is  a  measure  of  the  correlation  qualities  of  the  optical  wave 
propagating  through  the  atmosphere.  Typical  values  of  at  a  good  mountain  top  observatory  can 
range  from  5  cm  to  20  cm  (13:432). 

The  atmospheric  coherence  diameter  is  defined  by  (13) 
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where  A  is  the  wavelength,  c  is  the  zenith  angle.  C„'(z)  is  the  atmospheric  structure  constant,  and  z 
is  a  point  along  the  optical  path. 
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Subsequent  sections  will  show  how  the  atmospheric  coherence  diameter  is  intimately  related 
to  the  image  quality  and  resolution  of  the  adaptive  optical  imaging  system.  Seaion  2.2.2  shows  the 
expressions  for  the  atmospheric  optical  transfer  function  are  easier  to  understand  and  simpler  in  form 
when  put  in  terms  of  the  atmospheric  coherence  diameter.  Section  2.2.3  shows  the  resolution  of  an 
adaptive  optical  imaging  system  is  directly  related  to  the  ratio  of  the  systems  aperture  diameter  and 
the  atmospheric  coherence  diameter. 

222  Image  Quality.  A  performance  measure  of  an  imaging  system  operating  in  the  presence 
of  the  atmosphere  is  the  average  optical  transfer  funaion  (OTF)  (28).  The  average  OTF  is  used 
because  the  turbulence  is  a  random  process  in  time  and  space  and  the  ensemble  or  average  quantities 
are  of  interest  Comparison  between  the  system  optical  transfer  function  and  the  diffraction  limited 
transfer  function  gives  an  excellent  measure  of  the  system  performance  and  image  quality. 

The  Rytov  solution  for  the  propagation  of  an  electromagnetic  wave  in  a  homogeneous, 
isotropic  medium  with  zero  mean  Gaussian  random  phase  statistics  deflnes  the  long  exposure  OTF 
(13).  The  long  exposure  OTF  of  the  mediem  is  a  function  of  the  atmospheric  phase  structure 
function.  The  expression  for  the  phase  structure  function,  D^,  within  the  inertial  subrange  is  given 
by  (13) 

D.(r)  -  6.88  (L)"  (2.3) 

•  r 

where  r  is  the  spatial  separation  between  two  points  on  the  object  wavefront  and  r^  is  the  atmospheric 
coherence  diameter.  The  long  exposure  OTF  of  the  atmosphere  can  be  shown  to  be 


The  overall  OTF  of  the  optical  system  and  the  atmosphere  is  the  product  of  the  OTF  of  the  optical 
system  and  the  average  OTF  of  the  atmosphere.  The  overall  OTF  and  it  Fourier  transform  or  the 
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point  spread  funaion,  defines  the  image  quality  of  the  adaptive  optical  imaging  system.  Equations 
2.2  and  2.3  exemplify  the  dependence  of  image  quality  on  the  atmospheric  coherence  diameter. 


2.13  Resolution.  The  previous  section  pointed  out  the  effect  of  the  atmospheric  coherence 
diameter  on  the  imaging  qualities  of  the  adaptive  optical  imaging  system.  This  seaion  extends  the 
analysis  of  the  atmospheric  coherence  diameter  by  examining  its  effects  on  the  system’s  resolution  (K). 

The  diffraaion  limited  resolution  between  two  point  sources  is  represented  by  the 
relationship  (12:130) 


-  1-22 


A  d. 

i 


(2.5) 


where  d^  is  the  distance  from  the  aperture  to  the  image  plane.  Turbulence  effects  significantly  reduces 
the  resolution  from  that  given  in  equation  2.5.  A  useful  measure  of  the  resolution  atuinable  is  based 
upon  the  coherence  diameter  to  aperture  diameter  ratio. 


Figure  2.2  Normalized  Resolution  vs  Normalized  Diameter 


Figure  2.2  (13:431)  shows  a  plot  of  the  normalized  resolution  of  an  imaging  system  versus  the 
normalized  aperture  diameter.  X  and  X.^  are  the  resolution  and  the  maximum  resolution  attainable 
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by  the  system.  is  the  aperture  diameter  and  r^  is  the  atmospheric  coherence  diameter.  The  dashed 
line  represents  a  diffraction  limited  imaging  system,  and  the  solid  line  represents  a  system  subjected 
to  atmospheric  turbulence  effects.  As  figure  2.2  shows,  the  resolution  of  systems  subjected  to 
atmospheric  turbulence  effects  is  significantly  altered.  When  the  aperture  diameter  is  larger  than  r„, 
the  maximum  resolution  achievable  by  the  imaging  system  is  no  longer  a  function  of  the  aperture 
diameter.  Instead,  the  maximum  resolution  is  limited  by  the  ratio  of  the  aperture  diameter  to  the 
atmospheric  coherence  diameter.  As  the  aperture  diameter  to  r,  ratio  increases  beyond  unity,  larger 
aperture  diameters  offer  no  significant  improvement  in  resolution.  The  resolution  of  larger  aperture 
systems  coincides  with  the  resolution  of  a  system  that  has  an  aperture  diameter  equal  to  r^ 

To  exemplify  this  phenomena,  under  identical  operating  conditions,  the  maximum  resolution 
of  a  3  m  telescope  subjeaed  to  atmospheric  turbulence  is  comparable  a  same  telescope  with  an 
aperture  equal  to  r^  (5  to  20  cm). 

13  Adaptive  Optical  Imaging  Systems 

The  primaiy  function  of  an  adaptive  optics  imaging  system  is  to  provide  real-time  imagery  of 
exo-atmospheric  objects,  such  as  satellites.  The  generic  adaptive  optics  imaging  system  (Fig  2.3)  must 


Figure  Generic  Adaptive  Optical  Imaging  System 
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accomplish  three  basic  functions  to  negate  the  effects  of  atmospheric  turbulence.  The  three  functions 
are  to  sense,  reconstruct,  and  compensate  for  phase  distortions  induced  by  atmospheric  turbulence. 
This  section  answers  research  question  2  by  briefly  discussing  the  three  basic  functions  of  the  adaptive 
optical  imaging  system. 

Z3.1  Wavefront  Sensing.  Assuming  the  overall  wavefront  tilt  is  removed,  the  wavefront  sensor 
now  measures  the  residual  phase  gradients  present  in  the  incoming  wavefront.  A  wavefront  sensor 
senses  the  deviation  of  the  incoming  wavefront  from  an  ideal  plane  wave.  The  wavefront  sensor 
samples  the  incoming  wavefront  at  intervals  known  as  *subapertures'  (Fig  1.2).  Each  subaperture 
senses  the  localized  slope  deviation  of  the  incoming  wavefront.  The  two  most  common  wavefront 
sensors  are  the  shearing  interferometer  and  the  Hartmann-Shack  wavefront  sensor  (Fig  2.4).  A  model 
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Figure  2.4  Hartmann-Shack  Wavefront  Sensor 
of  the  Hartmann-Shack  wavefront  sensor  is  used  for  this  thesis. 


13.2  Wavefront  Reconstruction.  Wavefront  reconstruaion  receives  the  slope  sensor 
information  and  calculates  the  aauator  commands  needed  to  negate  the  phase  aberrations  induced 
by  atmospheric  turbulence.  The  wavefront  sensing  process  gives  slope  sensor  measurements  that  are 
a  linear  funaion  of  the  turbulence  induced  phase  aberration.  Due  to  this  linear  relationship  between 
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the  slope  sensor  measurements  and  the  phase  aberrations,  conventional  linear  estimation  techniques 
are  used  to  reconstruct  the  object  wavefront  (6:1).  The  two  most  common  linear  estimation 
techniques  are  termed  the  least  squares  and  the  minimum  variance  reconstructors  (28:1).  This  thesis 
uses  the  least  squares  reconstructor  as  the  comparison  baseline. 

13.3  Wavefront  Compensation.  The  deformable  mirror  receives  actuator  commands  from  the 
wavehront  reconstructor  to  provide  a  phase  conjugate  of  the  wavefront  phase  aberrations.  The  mirror 
uses  individually  controlled  aauators  to  make  a  piecewise  approximation  of  the  phase  conjugate 
wavefront  surface.  The  2  most  common  types  of  deformable  mirror  architectures  are  the  segmented 
mirror  and  continuous  face  sheet  mirror  (Fig  2.5). 

The  segmented  mirror  has  been  used  for  atmospheric  optical  compensation  for  many  years. 
Segmented  minors  can  compensate  for  piston  component  of  the  aberrated  wave  or  piston  and  tilt. 
In  a  segmented  minor,  the  actuator  phase  compensation  at  one  segment  is  not  affected  by  actuator 
control  of  any  other  segment  Early  integration  of  adaptive  optical  imaging  and  artificial  neural 
networks  used  segmented  minors  (31:19). 


Figure  2J  Deformable  Minor  Geometries 
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Continuous  facesheet  minors  use  a  thin  quartz  or  glass  fisicesheet  supported  on  arrays  of 
discrete  actuators.  These  discrete  actuators  deform  or  put  a  *dimple'  in  the  mirror  surface  to  provide 
a  conjugate  phase  correction.  This  thesis  uses  a  continuous  Eacesheet  mirror  as  a  baseline.  In  a 
continuous  facesheet  mirror,  any  given  actuator  command  influences  the  mirror  surface  over  the  entire 
mirror  diameter.  This  thesis  models  this  effect  by  using  Gaussian  actuator  influence  functions.  The 
actuator  influence  funaions  are  defined  at  each  actuator  center  and  any  phase  compensation  point 
is  found  to  be  a  combination  of  all  the  aauator  influences  summed  at  that  point. 

Z4  Fundamental  Limitations  on  the  Adaptive  Optical  Imaging  System 

The  atmosphere  imposes  fundamental  limitations  on  the  adaptive  optical  imaging  system 
(research  question  3).  These  limitations  can  be  loosely  grouped  as  limitations  on;  the  computational 
frame  time  of  the  adaptive  optical  imaging  system,  the  image  quality,  resolution,  and  the  light  level. 
This  section  presents  a  discussion  of  the  limitations  imposed  on  the  adaptive  optical  imaging  system 
by  atmospheric  turbulence. 

14.1  Computational  Frame  Time.  The  turbulent  nature  of  the  atmosphere  produces 
wavefront  phase  aberrations  which  exhibit  a  high  degree  of  temporal  instability.  This  temporal 
instability  imposes  a  fundamental  limitation  on  ihc  computational  frame  time  for  adaptive  optica! 
compensation.  The  adaptive  optics  system  must  sense  and  compensate  for  the  wavefront  aberrations 
within  the  atmosphere’s  coherence  time  (rj.  The  atmospheres  coherence  time  is  the  time  delay  that 
a  wavefront  and  a  temporally  delayed  version  of  the  wavefront  can  undergo  and  still  be  highly 
correlated  (i.e.  temporally  stable)  (13:138).  The  adaptive  optics  system  must  have  a  closed  loop 
control  bandwidth  of  at  least  100  Hz  to  guarantee  temporal  stability  of  the  object  wavefront.  This 
closed  loop  bandwidth  results  in  a  step  response  settling  time  of  less  than  10  milliseconds  (7:260) 
(8:224).  The  millisecond  frame  rate  requirement  for  real-time  operation  coupled  with  the  large 
number  of  control  channels  required  for  adaptive  optical  imaging  systems  imposes  severe  restrictions 
for  current  and  future  adaptive  optics  imaging  systems. 


The  conventional  wavefront  reconstruction  process  is  a  computationally  intense  environment. 
The  number  of  active  control  channels  and  the  computational  frame  time  limitations  make  the 
reconstruction  process  difficult  to  implement  without  sophisticated  and  extremely  fast  hardware  and 
software.  Coupling  the  magnitude  of  the  control  problem  with  the  variational  nature  of  faaors 
external  to  the  adaptive  optics  imaging  system,  make  the  task  of  exo-atmospheric  imaging  quite 
formidable. 


Z4.2  Light  Level  The  photon  count  per  subaperture  or  *light  level"  is  another  fundamental 
limitation  on  the  adaptive  optical  imaging  system.  The  detectability  of  the  wavefront  phase 
aberrations  is  directly  related  to  the  amount  of  light  emanating  from  the  object  At  very  low  light 
levels,  shot  noise  effects  in  the  wavefront  sensor  can  make  attempts  to  extract  an  image  virtually 
impossible.  The  wavefront  sensor  outputs  are  modeled  as  a  signal  plus  additive  noise  measurement. 
The  additive  noise  or  "slope  measurement  noise"  is  due  to  the  shot  noise  effects  of  the  detection 
process  and  is  assumed  to  be  uncorrelated  with  the  wavefront  phase.  The  noise  is  modeled  as  a  zero 
mean  Gaussian  random  process,  where  the  standard  deviation  of  the  slope  measurement  noise  a,  is 
defined  by 


a 


0.86  ir  fj 


0.74  ir  r, 


D  >r 


D  <  r 


O  O 


(2.6) 


where  n  is  a  correaion  faaor  which  compensates  for  the  finite  sized  detectors  separated  by  unusable 
space  and  Np  is  the  total  subaperture  photon  count.  The  value  of  n  is  assumed  to  be  1.5.  This  value 
corresponds  to  typical  values  found  for  charge  coupled  device  technology  (38:1919).  Equation  2.6 
points  out  the  inverse  relationship  of  the  additive  noise  to  the  light  level.  As  the  light  level  increases 
the  variance  of  the  slope  measurement  noise  decreases. 
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25  An  Alternate  Solution 

Research  question  4  asks  *Is  there  an  alternate  solution  to  the  problem?’  Recently,  Lockheed 
presented  an  alternate  solution  to  alleviate  some  of  the  speed  and  interconnection  problems 
encountered  with  conventional  analog  and  digital  wavefront  reconstructors  (7:260-268)  (8:222-229). 
This  solution  integrated  artifrcial  neural  networks  into  the  wavefront  reconstruaion  process.  The  data 
rate  of  the  artificial  neural  network  reconstructor  (5000  Hz)  is  almost  an  order  of  magnitude  faster 
than  a  conventional  STAR  array  processor  compensation  method  (600  Hz).  Artificial  neural  networks 
can  also  reduce  the  complex  interconnection  schemes  encountered  with  larger  deformable  mirror  and 
wavefront  sensor  arrays.  Although  this  integration  of  artificial  neural  networks  and  adaptive  optics 
is  still  in  the  early  stages  of  development,  the  preliminary  results  are  quite  promising. 

16  Art^ial  Neural  Networks 

This  section  discusses  the  operational  charaaeristics  of  artificial  neural  networks  (research 
question  5).  The  generic  anificial  neural  network  is  a  highly  parallel,  distributed  processing  structure 
that  consists  of  simple  processing  elements.  Each  processing  element,  or  neuron  has  a  single  output 
that  may  branch  out  to  many  peripheral  conneaions.  The  output  signal  can  have  any  funaional  form, 
but  must  depend  on  the  current  values  at  the  input  of  the  neuron.  Neural  networks  have  many 
names,  such  as  conneaionist  models,  distributed  processing  models,  or  neuromorphic  systems  (23:4). 
Whatever  the  nomenclature,  all  of  these  models  attempt  to  provide  performance  enhancements  in  a 
calculation  intense  environment  by  using  densely  packed,  highly  parallel  processing  elements.  Two 
of  the  basic  charaaeristics  used  to  describe  artificial  neural  networks  are  the  type  of  network 
(feedforward/recurrent)  and  the  learning  technique  (21:ill-221)  (27:39-40). 

16.1  Recurrera  and  Feedforward  Neural  Networks.  Feedforward  neural  networks  (Fig.  2.6) 
operate  on  the  sum  of  weighted  inputs  using  an  aaivation  funaion  or  nonlinearity  to  calculate  an 
output  For  single  layer  feedforward  networks  (Fig.  1.5)  the  aaivation  funaion  outputs  are  the  system 
outputs.  For  multiple  layer  feedforward  neural  networks  (Fig.  26)  the  aaivation  funaion  outputs 
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Figure  2.6  Multilayer  Feedforward  Artificial  Neural  Network 
become  weighted  inputs  to  the  next  layer  of  neurons,  and  so  on,  until  the  output  layer  produces  the 
system  output  Outputs  in  this  type  of  neural  network  are  changed  only  when  a  new  input  exemplar 
is  presented.  The  weights  in  these  type  of  networks  are  often  trained  to  produce  desired  outputs.  The 
feedforward  artificial  neural  network  studied  for  this  thesis  is  the  single  layer  perceptron. 

A  recurrent  neural  network  (Fig.  2.7)  is  a  neural  network  that  has  feedback  paths  itom 
neuronal  outputs  to  neuronal  inputs.  The  output  response  of  these  networks  is  dynamic.  The  outputs 
continually  changing  until  the  network  converges  to  reach  a  stable  value.  An  input  is  presented  to 
the  network  then  an  output  is  calculated.  The  outputs  are  then  fed  back  into  the  network  as  inputs. 
This  process  is  repeated  until  a  suble  condition  is  reached.  Stability  can  be  defined  as  convergence 
to  a  single  output,  as  in  the  case  of  a  Hopfieid  neural  network.  Convergence  can  also  be  defined  as 
a  predetermined  time  in  the  future,  as  in  the  case  of  a  recurrent  backpropagation  neural  network. 
The  recurrent  artifidal  neural  network  examined  for  this  thesis  are  the  Hopfieid  neural  network,  and 
an  extension  of  the  Hopfieid  network  that  ukes  advanuge  of  the  learning  properties  of  artificial 
neural  networks. 

262  Art^ial  Neural  Network  Learning.  Artificial  neural  networks  can  be  trained  to  learn 
relationships  between  pattems/exemplars.  Learning  is  the  encoding  of  information,  a  system  can  learn 
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Figure  2.7  Recurrent  Artificial  Neural  Network 

a  pattern  by  encoding  information  about  that  pattern.  Subjecting  a  neural  network  to  many  patterns 
allows  the  network  to  generalize  about  the  interrelationship  between  the  patterns.  These 
interrelationships  are  formed  by  statistical  process  inference  or  class  relationship  between  the 
ensemble  of  patterns.  The  ways  in  which  artificial  neural  networks  learn  are  described  as  being  either 
unsupervised,  supervised,  or  a  combination  of  both. 

In  unsupervised  training  the  training  feature  data  set  is  given  to  the  network.  The  data  along 
with  the  training  algorithm  position  the  weights  into  clusters  that  reflect  the  distribution  of  the 
training  dau  (40:2-9).  The  unsupervised  training  algorithm  uses  the  dau  to  approximate  the 
probability  distribution  of  the  ensemble  data  set.  The  inference  of  the  probability  distribution  is  used 
to  classify  subsequent  test  data. 

Supervised  learning  presents  the  neural  network  with  an  ensemble  of  training  data  pairs 
composed  of  input  data  features  and  their  desired  output  The  figure  of  merit  is  the  difference 
between  the  neural  network  calculated  output  and  the  desired  output  The  interconneaion  weights 
are  then  trained  to  minimize  some  funaion  of  this  error.  Typically,  the  weights  are  trained  to  reduce 
the  mean  squared  error  between  the  calculated  and  desired  outputs.  The  training  process  is  repeated 
over  many  training  pairs  to  allow  the  network  to  learn  the  statistical  relationship  binding  the  entire 
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data  set  This  thesis  addresses  a  specific  supervised  training  technique  known  as  the  backpropagation 
method  (17:593)  (39:270-280).  The  backpropagation  training  method  will  be  applied  to  the  single 
layer  perceptron  and  the  Hopfield  artificial  neural  networks. 

16.3  The  Hopfield  Neurcd  Network.  The  Hopfield  neural  network  or  relaxation  neural 
network  Is  the  network  of  choice  for  many  hardware  implementations.  The  reasons  are  the 
conventional  Hopfield  neural  network  has  fixed  interconnection  weights  (27:78-82)  and  extremely  fast 
convergence  properties.  For  these  reasons,  the  Hopfield  neural  network  is  the  baseline  recurrent 
neural  network  used  for  this  thesis. 

The  Hopfield  neural  network  has  been  applied  to  many  classes  of  problems.  These  classes 
can  be  loosely  classified  by  the  type  of  problem  the  neural  network  is  attempting  to  solve.  These  two 
application  groups  are  the  associative  memory  (1:204)  and  funaion  estimation  (10:805-810)  (41:499- 
506)  (33:533-541)  applications.  This  thesis  diverges  from  the  associative  memory  applications  and 
applies  the  Hopfield  neural  network  to  a  classical  optimization  problem,  adaptive  optical  wavefront 
reconstruaion.  The  operational  concept  of  the  Hopfield  neural  network  begins  by  presentation  of 
an  input  to  the  network.  The  network  continually  operates  on  the  weighted  sum  of  inputs  and 
feedback  outputs,  until  the  network  converges  to  a  stable  state.  The  Hopfield  neural  network  can 
processes  the  weighted  sums  by  passing  them  through  a  nonlinear  funaion  such  as  a  hard  limiter  or 
sigmoid  (squashing  funaion).  This  thesis  uses  a  soft  limiter  or  ramp  nonlinear  funaion  to  calculate 
neuronal  outputs.  When  using  Hopfield  neural  networks,  fundamental  issues  must  be  addressed  to 
adequately  predia  and  chaiaaerize  the  systems  performance.  The  most  important  issues  are  to  a 
Hopfield  neural  network  designer  are  convergence  subility  and  local  minima  (Fig  2.8). 

Convergence  stability  posed  a  formidable  problem  to  early  researchers.  It  was  very  hard  to 
predia  which  networks  would  be  stable.  This  problem  continued  until  fundamental  work  was 
accomplished  by  Cohen  and  Grossberg  to  chaiaaerize  the  convergence  stability  for  this  specific  class 
of  recurrent  neural  networks.  The  dynamic  Hopfield  neural  network  system  possesses  convergence 
stability  if  there  is  a  Lyapunov  funaion  that  describes  the  performance.  The  existence  of  a  Lyapunov 
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function  proves  stability,  whereas  the  nonexistence  of  the  Lyapunov  function  does  not  infer  anything 
about  system  stability  (21:69-73).  The  Lyapunov  function  of  the  HopGeld  neural  network  is  often 
termed  the  'energy*  funaion  (18:3089-3090).  Examination  of  the  gradient  of  the  energy  function  with 
respect  to  the  system  outputs  can  prove  convergence  subility.  The  energy  function  of  the  HopGeld 
neural  network  is  intimately  dependent  upon  the  properties  of  the  feedback  weight  interconnection 
matrix.  Exhaustive  analysis  of  the  convergence  issues  has  resulted  in  two  requirements  for  the 
feedback  interconneaion  weight  matrix  (5:815-826).  The  Grst  is  there  is  no  self  feedback  for  the 
HopGeld  neurons,  or  the  weight  from  the  i***  to  the  i"*  neuronal  element  is  zero.  The  Gnal  restriaion 
imposed  by  convergence  stability  is  that  the  feedback  interconnection  weight  matrix  must  be  symmetric 
about  the  main  diagonal,  or  the  weight  from  the  i*"  to  the  neuronal  element  equals  the  weight  Gom 
the  to  the  i*^  neuronal  element. 

When  an  energy  surface  becomes  complicated  and  has  many  peaks  and  valleys,  local  minimas 


Wil^ 


Figure  2.8  Local  Minimum 

become  a  signiGcant  concern  (22:137)  (30:327).  Issues  of  numerous  local  minima  plagued  early 
researchers  using  HopGeld  neural  networks.  Local  and  global  minimas  represent  stable  solutions  in 
the  energy  state  space  for  this  class  of  neural  networks.  When  a  HopGeld  neural  network  converges. 
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it  settles  into  a  miniinuin  on  the  energy  surface.  Qjnvergence  to  a  local  minima  represents  an 
incorrect  solution. 

Z6.4  Single  Layer  Perceptrons.  The  single  layer  perception  artificial  neural  network  was 
originally  proposed  by  Rosenblatt  in  1959  (27:47).  The  single  layer  perceptron  artificial  neural 
network  and  a  single  perceptron  are  shown  in  Fi^.  1.5  and  1.6.  The  inputs  to  the  perceptron  can  be 
from  other  perceptrons  or  the  external  sources  (e.g.  slope  sensor  outputs).  Each  input  is  multiplied 
by  a  synaptic  weight  W|j  and  summed.  A  bias  term  (^i)  is  added  and  a  nonlinear  transformation  is 
applied.  There  are  several  forms  for  the  nonlinearity,  for  example  a  hard  limiter,  a  linear  ramp,  or 
a  sigmoid.  The  bias  term  can  also  be  treated  as  another  weight  multiplied  by  a  constant  of  one.  The 
single  layer  perceptron  uses  the  weights  to  map  the  vector  of  inputs  onto  a  vector  of  outputs. 

The  key  to  function  estimation  using  perceptrons  is  using  a  learning  algorithm  to  find  the 
correct  set  of  weights.  The  most  common  perceptron  learning  algorithm  is  called  1>ackward  error 
propagation,'  or  simply  *backprop.'  Backprop  calculates  the  error  at  the  output  layer  and 
backpropagates  the  error  through  the  network  correaing  the  weights.  The  correaion  represents  a 
gradient  descent  down  the  error  surface  toward  a  minimum.  Perceptron  learning  using  backprop  is 
an  automatic  way  of  setting  the  weights  and  thresholds.  Training  the  perceptron  to  actual  data  allows 
the  inference  of  the  probability  distribution  of  the  data.  Once  the  probability  distribution  is  inferred, 
subsequent  presentation  of  unseen  inputs  allows  the  perceptron  to  generalize  and  estimate  an  answer 
based  upon  inferred  probability  distribution. 


2  7  Integration  of  Adaptive  Optics  with  Artificial  Neural  Networks 

This  seaion  answers  research  question  6  by  describing  how  the  previous  implemenutions  of 
artiflcial  neural  networks  operated,  the  results  of  these  implemenutions,  and  the  limiutions  of  these 
implemenutions.  The  earliest  forms  of  the  adaptive  optical  system  control  were  realized  using 
networks  of  operational  ampliflen  and  resistors  which  achieved  the  desired  solution  during  steady 
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state  conditions  (14:669).  These  implementations  were  a  rudimentary  form  of  the  Hopfield  artificial 
neural  network  (31:20).  Recently,  Lockheed  presented  further  applications  of  artificial  neural 
networks  to  the  adaptive  optical  imaging  system. 

Z7.1  Lxxkheed  Programmable  Analog  Neural  Network.  An  analog  neural  network  breadboard 
consisting  of  256  neurons  and  2048  programmable  synaptic  weights  has  been  construaed  and  tested 
(7:260-268)  (8:222-229).  This  analog  neural  network  implements  Hudgin’s  recursive  algorithm  for 
optimal  wavefront  estimation  using  an  array  of  noisy  wavefront  slope  measurements.  The  neural 
network  finds  the  optimal  control  matrix  to  command  the  deformable  mirror  actuators.  The  actuators 
deform  the  mirror  surface  to  compensate  for  the  phase  aberrated  wavefront.  The  current 
implementations  are  an  extension  of  work  previously  accomplished  by  Smithson  in  1987  (31).  In  1987, 
Smithson  began  his  studies  into  the  integration  of  artificial  neural  network  with  adaptive  optics. 
Smithson  noticed  that  the  analog  feedback  network  implementation  of  the  adaptive  optics  control 
algorithm  developed  by  Hudgin  was  a  special  case  of  the  Hopfield  neural  network.  The 
implementation  of  the  Hudgin  algorithm  used  fixed  weights  on  the  feedback  actuator  voltages 
characteristic  of  the  Hopfield  neural  network  applied  to  continuous  work  functions. 

Smithsons’  implementation  was  followed  by  work  accomplished  by  Fisher  and  others.  Fisher 
used  Hudgin’s  algorithm  to  integrate  artificial  neunl  networks  with  an  adaptive  optics  system.  Fishers 
contributions  was  a  demonstration  of  the  ability  for  open  and  closed  loop  control  (7:260-268)  (8:222- 
229)  of  a  segmented  deformable  mirror  array. 

The  open  loop  control  consisted  of  training  the  artificial  neural  networks  using  actual 
actuator  command  signals  as  the  training  veaor  with  no  feedback  between  actuator  commands. 
Fisher’s  open  loop  control  implementation  is  equivalent  to  a  single-layer,  feed-forward  artificial  neural 
network.  Fisher’s  training  used  a  perceptron  type  training  algorithm  to  determine  the  best  weights 
for  the  input  slope  sensor  values.  The  open  loop  study  was  then  extended  to  allow  actuator  command 
feedback,  in  this  case  both  the  input  and  feedback  weights  were  trained  using  an  algorithm  similar  to 
the  backpropagation  training  algorithm.  The  closed  loop  control  used  actuator  command  feedback 
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and  adapted  the  system  to  optical  component  drift  or  settling  due  to  thermal  effects.  For  closed  loop 
control,  the  loop  was  opened  at  3  second  intervals  and  system  parameters  were  sampled.  Based  upon 
the  sampled  parameters  the  input  and  output  feedbac!  artificial  neural  networks  weights  were 
adjusted. 

The  basis  for  the  previous  applications  of  artificial  neural  networks  to  the  adaptive  optical 
imaging  system  all  relied  upon  the  recursive  algorithm  developed  in  1977  by  Richard  H.  Hudgin 
(19:373-378).  This  algorithm  uses  a  large  array*  assumption.  The  algorithm  also  implicitly  assumes 
that  there  is  no  significant  interaaion  between  the  mirror  aauators.  These  assumptions  result  in  a 
convolutional  form  for  a  'perfect  reconstruaor*  that  results  in  a  recursive  algorithm.  This  recursive 
algorithm  showed  that  an  estimate  of  the  actual  wavefront  phase  is  a  linear  combination  of  the  four 
'nearest  neighbor  aauator*  phase  and  phase  difference  values.  The  shortfall  of  this  assumption  is  that 
it  does  not  allow  for  edge  effects  encountered  at  the  mirror  periphery.  Due  to  the  noninteraction 
between  deformable  mirror  commands,  this  algorithm  is  well  suited  to  describe  the  operation  of  a 
segmented  mirror,  but  is  deficient  when  attempting  to  explain  the  operation  of  a  continuous  minor. 

Z8  Summary 

The  current  adaptive  optic  imaging  systems  may  not  achieve  the  high  data  processing 
throughput  requirements  necessitated  by  future  military  image  resolution  requirements.  The  high 
degree  of  parallelism  of  neural  networks  has  resulted  in  significant  increases  in  the  throughput,  and 
may  offer  the  solution  to  the  computational  frame  time  limitations  encountered  by  adaptive  optic 
imaging  systems. 

To  extend  the  previous  implementations  of  artificial  neural  network  reconstructors,  this  thesis 
develops  a  two  alternate  techniques  that  allow  full  control  of  all  the  actuators  on  a  continuous 
facesheet  deformable  mirror  (research  question  7).  a  recursive  technique  is  developed  that  is  well 
suited  for  a  Hopfield  artificial  neural  network.  A  feedforward  technique  is  developed  for  a  single 
layer  perception  artificial  neural  network  architeaure.  Performance  of  the  artificial  neural  network 
reoonstructors  is  analyzed  in  response  to  seleaed  parametric  variations.  The  parameters  that  are 
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varied  are  the  light  level  and  atmospheric  coherence  diameter.  The  use  of  artificial  neural  network 
training  is  used  to  examine  the  performance  enhancements  achievable  using  artificial  neural  networks. 
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HI.  Theory 


3.1  Introduction 

The  previous  chapter  provided  background  knowledge  of  the  problem  (atmospheric 
turbulence),  the  system  (adaptive  optical  imaging  system),  and  artificial  neural  networks.  This  chapter 
uses  the  background  knowledge  to  develop  artificial  neural  network  reconstructors.  This  chapter 
outlines  the  approach  taken  to  develop  artificial  neural  network  reconstruaors.  The  basic  approach 
consisted  of  the  following  research  steps; 

(1)  Define  the  fundamental  adaptive  optical  system  models. 

(2)  Develop  the  comparison  baseline  (least  squares  reconstructor)  model. 

(3)  IDevelop  the  reconstructor  models  that  are  implemented  using  artificial  neural  networks. 

(4)  Validate  the  reconstructor  models  by  mapping  the  models  into  conventional  artificial 
neural  network  state  spaces. 

This  chapter  is  developed  by  sequentially  addressing  each  research  step.  Section  3.2  satisfies  research 
steps  1  and  1  Section  3.2  serves  three  primary  purposes.  The  first  purpose  is  to  present  the 
fundamental  adaptive  optical  imaging  system  models.  The  second  purpose  is  to  derive  the  comparison 
baseline  model  (least  squares  reconstructor).  The  final  purpose,  and  perhaps  the  most  important  is 
to  present  the  primary  design  equation  (Eq  3.9)  used  to  develop  the  artificial  neural  network 
reconstructors.  Research  step  3  is  addressed  in  section  3.3.  Section  3.3  develops  the  recurrent  and 
feedforward  reoonstruaor  models.  Research  step  4  is  satisfied  in  seaion  3.4.  Section  3.4  presents 
the  conventional  state  space  models  for  the  artificial  neural  networks.  The  models  developed  in 
section  3.3  are  then  mapped  into  the  artificial  neural  network  sute  spaces. 

3.2  The  Adaptive  Optics  System  Models 

3.Z1  General  System  Development.  This  section  satisfies  research  steps  1  and  2.  First,  the 
fundamental  adaptive  optical  imaging  system  models  are  developed.  Next,  the  development  of  the 
comparison  baseline  (least  squares)  reconstruaor  is  presented.  This  seaion  also  presents  the  primary 
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design  equation  (Eq  3.9)  for  the  artificial  neural  network  reconstruaors. 

The  adaptive  optical  system  analysis  closely  follows  the  work  done  by  Wallner  (35;  1771-1776), 
Welsh  (38:1913-1923),  and  Roggemann  (28:1-20).  The  analysis  begins  with  the  definition  of  the  zero 
mean  or  piston  removed  wavefront  phase  ^(x,t),  where  x  represents  a  two  dimensional  position  vector 
within  the  system  pupil  and  t  is  time.  The  piston  removed  phase,  ^(x,t),  is  related  to  the  phase  of 
the  incident  wave  front,  ^x,t),  by 

^(xj)  -  i>(xj)  -  »^(r)  t^(£a)  d^x  (3.1) 

where  W;^(x)  is  the  aperture  weighting  function.  The  aperture  weighting  function  is  assumed  to  satisfy 
the  following  relationship 

“u®  <^1  ■  I 

This  thesis  assumes  a  circular  aperture.  A  circular  aperture  results  in  an  aperture  weighting  funaion 
of  the  form 


1 


-  0 


(3.3) 


where  is  the  aperture  diameter. 

The  piston  and  tilt  removed  phase,  7(x,t),  is  represented  by  the  difference  between  the  piston 
removed  phase  and  the  tilt  component 
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i(y)  -  ^(i^)  -  £(0  ‘i 


(3.4) 


where  the  wavefront  phase  tilt  component,  a(t)  is  given  by  (28:6) 

a{t)  ~  ^  (3.5) 


The  analysis  and  simulation  assumes  that  the  piston  and  tilt  component  of  the  phase  are  removed. 

A  wavefront  sensor  is  used  to  sense  the  wavefront  phase  7(x,t).  The  output  of  the  wavefront 
sensor  is  modeled  by  (38:1914) 

^rt(£)  •  </(£)]  d^x  +  a„(t)  (3.6) 


where  $^(1)  is  the  output  of  the  n*^  slope  sensor  at  time  t,  is  the  subaperture  weighting  function 
of  the  n'^  slope  sensor,  77(s.t)  is  the  spatial  gradient  of  the  piston  and  tilt  removed  phase,  d  is  the 
unit  veaor  in  the  direction  of  slope  sensor  sensitivity,  and  a,(t)  is  the  slope  measurem  ent  noise  for 
the  n*'*  wavefront  slope  sensor. 

The  measurement  noise,  a,(i)  is  due  to  shot  noise  effects  in  the  wavefront  sensor  and  is 
assumed  to  be  uncorrelated  with  the  wavefront  phase.  The  noise  is  modeled  as  a  zero  mean  Gaussian 
random  variable.  The  standard  deviation  of  the  slope  measurement  noise,  a.(t)  is  defined  by 
(38:1918-1920) 


where  i;  is  a  correction  foctor  which  accounts  for  the  finite  sized  detectors  and  Np  is  the  total 
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subaperture  pboton  count.  The  value  of  i;  is  assumed  to  be  1.5.  This  value  corresponds  to  typical 
values  found  for  charge  coupled  device  technology  (38:1919). 

Integrating  the  Srst  term  in  Eq  3.6  by  parts,  yields 

^.(0  -  •  ^l7(r/)  dh  *  a„(0  (3-8) 

For  notational  simplicity,  the  bracketed  term  is  represented  by  W,'(x),  where  the  superscript  s 
represents  the  gradient  of  the  weighting  function  in  the  direction  of  sensitivity  of  the  n*^  slope  sensor. 

The  reconstruction  matrix  Mj,  maps  the  measured  slopes  into  a  deformable  mirror  actuator 
command.  This  matrix  is  typically  found  by  using  a  least  squares  or  minimum  variance  approach 
(28:1)(24).  This  matrix  is  the  essential  element  of  the  artificial  neural  network  reconstnictor.  This 
matrix  will  determine  the  interconneaion  weights  of  both  realizations  of  the  artificial  neural  network 
reconstructors.  The  actuator  command,  Cj(t)  is  related  to  the  slope  sensor  measurements  by 

<^,(0  W>.^,(0  (3.9) 

Equation  3.9  is  the  fundamenul  design  equation  used  to  develop  the  artificial  neural  network 
reconstructors.  Equation  3.9  determines  bow  the  measured  wavefront  slopes  are  used  to  produce  the 
deformable  mirror  aauator  commands. 

The  instantaneous  phase  correction  associated  with  the  aauator  command  Cj(t)  is  defined  by 

l](y)  -  c.(f)e.(jc)  (3-10) 

where  ej(x)  is  the  aauator  influence  funaion  or  aauator  impulse  response  funaion  to  a  unit  actuator 
command.  Here  and  in  the  simulation  Gaussian  aauator  influence  funaions  are  used.  The  aauator 
influence  at  any  point  g  in  the  aperture  plane  due  to  the  i"*  aaor  is  given  by 
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(3.11) 


«.(/)  .  e  I 

where  ^  is  the  location  of  the  aauator  center,  and  aj  is  the  nns  width  of  the  actuator  influence 
function. 

The  sum  over  ail  the  actuators  of  Eq  3.10  defines  the  estimated  phase,  7’(x,t)  of  the  incoming 
wavefront 


r(i<0  -  Et  ^j(0  «;(£) 

-  i;  E.  ^-(0 


(3.12) 


3.22  Least  Squares  Reconstructors.  The  reconstruction  matrix  (M^  allows  the  calculation 
of  the  deformable  mirror  aauator  commands  using  the  measured  slopes  (Eq  3.9).  The 
implementation  of  the  conventional  and  artificial  neural  network  reconstruaors  hinges  on  the 
definition  of  the  reconstruaion  (M^)  matrix.  A  conventional  technique  that  defines  the 
reconstruaion  matrix  is  the  least  squares  approach. 

llie  least  squares  or  closed  loop  reconstruaor  uses  feedback  control  of  the  aauator  positions 
to  compensate  for  the  aberrated  wavefront.  The  phase  estimate,  7’(x,t)  is  used  to  calculate  an 
estimated  slope  measurement  using  Eq  3.8.  The  difference  or  error  between  the  measured  slopes 
(s„(t))  and  the  estimated  slopes  is  defined  as  s„’: 

-  r,(0  -  -»C(£)  [c  (0«  (E)  1 


Since  the  aauator  command  is  independent  of  x,  the  slope  error  can  be  written  as 
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-  ^,(0  -  c  (t)  (£)«  (£) 


(3.14) 


The  integral  can  be  represented  by  a  (J  x  N)  matrix,  P*..  that  maps  the  actuator  command  vector  into 
subapenure  space.  The  difference  between  the  measured  and  estimated  slopes  is  now 

K(0  c.(t)P^  (3-15) 


To  calculate  the  least  squares  reconstructor  control  matrix,  the  mean  squared  difference  between  the 
measured  and  the  estimated  slopes  is  minimized  with  respect  to  the  actuator  command  vector. 
Minimizing  the  expansion  of  s,*’’  s,  results  in  the  least  squares  reconstructor  control  matrix 

Equations  3.14  through  3.16  define  the  terms  of  the  least  squares  reconstruction  matrix.  The  least 
squares  reconstruction  matrix  is  a  funaion  of  only  the  actuator  influence  function  and  the  directional 
gradient  of  the  subaperture  weighting  function. 

3.3  Reconstructor  Model  Development 

This  section  satisfies  research  step  3,  development  of  reconstructor  models.  The  models 
developed  in  this  section  are  implemented  as  artificial  neural  network  reconstructors.  To  accomplish 
the  development,  Eq  (3.9)  is  manipulated  to  forms  easily  implemented  using  recurrent  and 
feedforward  artificial  neural  networks.  This  section  begins  with  the  development  of  the  recurrent 
reconstructor  model  and  concludes  with  the  feedforward  reconstruaor  model. 

3.3.1  Recurrent  Reconstructor  Model  Development  This  section  uses  equation  3.9  to  develop 
a  recurrent  reconstructor  model  This  recurrent  algorithm  will  then  be  mapped  into  Hopfield  neural 
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network  state  space.  Only  after  the  algorithm  is  mapped  into  HopSeld  sute  space  can  the  name 
Hopfield  neural  network  reconstructor  be  accurately  applied. 

The  algorithm  developed  in  this  section  forms  the  basis  of  the  Hopfield  neural  network  used 
in  this  thesis  research.  Equation  3.9  is  manipulated  to  a  form  easily  implemented  using  an  recurrent 
artificial  neural  network.  The  feedback  control  algorithm  development  begins  with  the  fundamental 
equation  for  the  command  signal  to  the  deformable  mirror  actuator  (Eq  3.9). 

-  L.  [( <  (0)  ♦  ( (0)1 


where  the  contributions  to  Cj(t)  by  the  x  and  y  directed  wavefront  sensor  measurements  are  explicitly 
separated  into  two  terms.  Based  upon  this  actuator  command  signal  vector,  the  difference  between 
the  any  k"*  actuator  commands  is 


^(0  -  c*(o  -  E.  ^".(0 

♦  (A<,  -  A#;,)/,(oi 


(3.18) 


Defining 


(3.19) 


the  difference  in  Eq  3.18  is  rewritten  as 


C>(0  -  Ct(0  ♦  E  ^(0  ♦  ^l^i-k,n  i!i(01 


(3.20) 


Using  Eq  3.20  twice,  the  j**  actuator  command  can  be  written  in  terms  of  the  k"*  and  the  k’  actuat()p< 
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*  (Ai//.*.,  ♦  A^/.^^)  S^CO] 


(3.21) 


Equation  3.21  is  the  recurrent  reconstructor  model  Equation  3.21  will  be  implemented  using  a 
HopGeld  artificial  neural  network.  The  difference  in  actuator  commands  between  all  the  deformable 
mirror  actuators  and  their  neighbors  defines  a  set  of  difference  equations.  These  difference  equations 
are  used  to  establish  interdependency  of  all  deformable  mirror  actuator  commands.  The  number  of 
neighboring  actuator  conunands  used  (i.e.  how  many  feedback  interconnections),  is  determined  by  the 
Hopfield  neural  network  feedback  interconnection  weight  matrix  stability  criteria  (sea.  2.6.3). 

The  feedback  interconneaion  matrix  is  defined  by  connecting  a  single  aauator  with  only  two 
of  its  neighbors.  The  interconneaion  weight  of  1/2  from  the  to  the  k^  aauator  is  equal  to  the 
interconneaion  weight  from  the  k"*  to  the  j"*  aauator.  The  self  feedback  weight  for  the  actuator 
is  zero.  Knowing  that  two  nearest  neighbor  aauator  feedback  commands  are  needed,  the  aauator 
commands  are  now  linked  by  'connecting  the  dots”  in  the  deformable  mirror  plane  (Fig.  3.1).  A  single 
continuous  line  defines  the  aauator  feedback  connections.  By  using  a  single  continuous  line,  the 
dependence  of  each  aauator  command  upon  all  of  the  other  deformable  mirror  aauator  commands 
is  defined.  This  dependence  is  insured  through  a  cascading  effea  of  aauator  command  dependence. 
An  example  of  this  'cascading’  effea  is  shown  for  aauator  1  whose  command  depends  upon  aauators 
2  and  6.  Aauator  6  also  depends  on  aauator  12’s  command,  and  so  on  until  each  aauator  command 
is  related  to  a  weighted  version  of  each  other  aauator  in  the  array. 
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3.3.2  Feedforward  Reconstructor  Model  Development  The  algorithm  used  for  the  feedforward 
reconstructor  model  is  simply  Eq  (3.9).  Subsequent  mapping  of  this  equation  into  the  single  layer 
perceptron  artificial  neural  network  sute  space  demonstrates  the  ease  of  implemenution  of  this  type 
of  feedforward  reconstructor  model.  The  significant  advantage  of  using  a  single  layer  perceptron 
artificial  neural  network  over  conventional  techniques  is  the  high  degree  of  parallelism.  This  high 
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Figure  3.1  Actuator  Feedback  Connections 


degree  of  parallelism  should  provide  significant  speed  advantages.  The  added  feature  of  artificial 
neural  network  training  should  enhance  the  performance  of  this  technique  over  conventional 
reconstructors. 


3.4  Reconstructor  Model  Mapping  into  Conventional  Artificial  Neural  Network  State  Space 

To  satisfy  research  step  4,  the  reconstniaor  models  must  be  mapped  into  conventional 
artificial  neural  network  state  spaces.  This  section  answers  the  question,  'Are  these  reconstructor 
models  artificial  neural  networks  T  The  mapping  of  the  reconstniaor  models  into  neural  network 
state  spaces  validates  the  models  as  being  special  cases  of  the  Hopfield  and  single  layer  perceptron 
anificial  neural  network.  Each  reconstniaor  (feedforward,  recurrent)  mapping  is  developed  by  first 
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Tabkn  Aauator  Feedback  Conneaions 


j 

k,k’ 

j 

k3’ 

Bi 

2,6 

12 

6,11 

2 

13 

13 

8,18 

3 

2,7 

14 

9,15 

“1 

4 

5.9 

15 

14,19 

5 

4,10 

16 

10,11 

6 

1.12 

17 

18,21 

7 

33 

18 

13,17 

8 

7.13 

19 

1530 

9 

4,14 

20 

1931 

10 

5,16 

21 

1730 

11 

12,16 

presenting  the  conventional  neural  network  state  spaces.  The  reconstructor  models  are  then  mapped 
into  the  conventional  neural  network  state  spaces.  First,  the  feedforward  model  is  mapped  into  single 
layer  perceptron  state  space.  Finally,  the  recurrent  reoonstructor  model  is  mapped  into  Hopfield 
neural  network  sute  space. 

3.4.1  Single  Layer  Perceptron.  The  general  equation  for  a  single  layer  perceptron  artiliciai 
neural  network  can  written  as  (27:52) 

a  *  ly  7 

where  9  is  a  vector  of  activations  of  the  perceptrons,  W  is  the  weight  matrix  composed  of  each 
individual  vector  of  perceptron  weights,  and  !  is  the  veaor  of  inputs.  When  a  represents  the  outputs 
of  all  the  perceptrons,  the  single  layer  perceptron  artificial  neural  network  performs  a  mapping  of  the 
input  vector  into  a  set  of  output  veaors. 


3.4.2  The  Feedforward  Reconstructor  Model  and  the  Single  Layer  Perceptron.  The  feedforward 
recoastructor  model  is  simply  equation  3.9.  Comparing  equations  3.9  and  3.19  depicts  a  relatively 
simple  mapping.  For  linear  outputs,  a  corresponds  to  the  vector  of  aauator  commands,  W 
corresponds  to  the  reconstructor  matrix  M^,  and  I  corresponds  to  the  slope  sensor  inputs  S„.  For 
sigmoidal  activation  functions,  there  must  be  an  'inverse  sigmoid*  applied  to  each  neuronal  output. 
Sigmoid  activation  functions  are  used  in  this  thesis  to  allow  easier  training  of  the  single  layer 
perceptron  artificial  neural  network  reconstructor. 

3.4.3  The  Hopfield  Algorithm.  This  section  defines  the  conventional  Hopfield  state  space 
equations.  For  a  continuous  work  function,  Hopfield  defines  his  neural  networks  equation  of  state 
as  (18:3089:3090) 


C  m  ^  T  V  —  *  I 
‘It  ^  >  R,  ‘ 


(3.23) 


where  C,  is  the  input  capacitance  to  a  neuron,  u,  is  the  input  voltage  to  the  i*^  neuron, 

T,j  is  the  interconnection  conductance  (i.e.  weight)  from  i*  neuron  to  j*  neuron,  Vj  is  the  output  of 
neuron,  R,  »  1/T,  is  the  input  resistance  to  the  i'*’  neuron,  and  I,  is  the  input  current. 

Upon  convergence,  or  steady  state  conditions,  £q  3.23  reduces  to 


u 

t 

R 


T  V  *  I 

V  J  • 


(3.24) 


3.4.4  The  Recurretu  Reconstructor  Model  and  the  Hopfield  Algorithm.  This  section  will  map 
the  recurrent  reconstructor  model  into  Hopfield  sute  space.  The  steady  state  analysis  (Eq  3.24)  is 
used  to  define  the  Hopfield  artificial  neural  network  state  space.  Equation  3.21  shows  that  any 
actuator  control  signal  is  determined  from  a  linear  combination  of  two  weighted  neighboring  actuator 
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feedback  signals  and  a  weighted  sum  of  the  slope  sensor  outputs.  Upon  manipulation  of  Eq.  3.21  the 
actuator  command  is  represented  by 


■  I.  T,  c,«)  .  i  i:.  AM,  (3.25) 


where  the  is  the  multiplicative  1/2  term  on  the  feedback  actuator  commands,  the  directional  AMj„’s 
are  combined  into  a  single  aM^,  and  the  directional  slopes  are  combined  and  represented  by  1„.  The 
time  dependence  now  reflects  the  discrete  time  synchrounous  update  nature  of  the  model.  A  given 
aauator  command  is  updated  with  the  feedback  information  from  the  previous  time.  This  results  in 
the  *t  +  r  and  *t*  dependence  of  the  actuator  commands.  The  last  term  is  a  consunt  for  any  actuator 
until  the  slope  sensor  measurement  are  updated,  therefore  this  term  can  be  represented  by  Ij.  The 
Ti,  matrix  is  composed  of  weights  equal  to  1/2  for  each  neighbor  of  actuator  j  found  in  Table  11. 
Therefore,  at  any  time  t-f  1,  a  general  equation  for  the  i***  actuator  command  due  to  feedback  from  the 
j**  neighbors  and  the  weighted  slope  sensor  outputs  is  written  as 

-/.(E,  T.c^it)  ♦/,]  (3.26) 

where  f|,  is  the  nonlinearity  or  activation  funaion.  This  thesis  uses  a  soft  limiter  with  a  unity  gain  in 
the  linear  region  of  operation.  Here  the  assumption  is  made  that  the  soft  limiters  do  not  saturate, 
therefore  the  right  side  of  Eq  3.25  is  just  the  argument  of  the  limiter.  Comparing  Eq  3.2S  to  Eq  3.24 
shows  that  they  are  virtually  identical.  For  a  linear  activation  function  to  the  i*^  neuron,  th:  input  is 
the  same  as  the  output,  and  the  equations  are  identical  except  for  the  R,  term.  The  is  the  input 
resistance  for  the  i**  neuron.  This  thesis  assumes  that  all  of  the  neurons  are  identical,  hence  R,  =  R, 
...  -  R.  For  this  case,  both  sides  of  Eq  3.24  can  be  multiplied  by  R.  The  only  effect  of  the 
multiplication  by  R  is  to  scale  the  weight  matrix  and  input  veaor  to  the  HopGeld  neural  network,  not 
affecting  the  convergence  stability  in  any  way.  With  these  assumptions,  the  adaptive  optical  image 
reconstmaion  algorithm  represents  a  special  case  of  the  Hopfieid  neural  network. 
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3.5  Summary 

This  chapter  has  gone  through  the  fuadamental  theory  used  in  the  developmeni  of  »his 
research  effort  The  reconstruaor  models  conform  to  the  Hopfield  and  the  single  layer  perceptron 
artificial  neural  network  models.  Since  the  models  conform  to  the  conventional  artificial  neural 
network  representations,  the  term  'artificial  neural  network  'econstnictor*  is  now  applicable. 
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IV.  Analysis  Results 


4.1  Introduction 

This  chapter  presents  the  results  of  the  analysis  and  simulation.  The  task  of  analyzing  and 
simulating  the  adaptive  optical  imaging  system  is  an  intricate  process.  To  adequately  address  each 
specific  part,  the  entire  simulation  process  was  divided  into  specific  experiments.  The  resulting 
experiments  are  defined  below. 

Experiment  # 

(1)  Esublish  system  comparison  baseline  (least  squares  reconstructor)  analysis 

results. 

(2)  Integrate,  analyze,  and  test  system  performance  using  artificial  neural  network 

reconstructors.  Compare  performance  of  adaptive  optical  imaging  system 
with  artificial  neural  network  reconstructois  to  baseline  system. 

(3)  Examine  the  effects  on  adaptive  optical  imaging  system  performance  by  using 

artificial  neural  network  reconstructois  with  sparsened  slope  measurement 
weights. 

(4)  Implement  training  algorithms  and  examine  effects  on  overall  system 

performance. 

Section  4.2  discusses  experiment  1.  Section  4.2  presents  the  parameters  used  and  the  results  obtained 
using  the  least  squares  reconstructor.  Experiment  2  is  discussed  in  section  4.3.  Seaion  4.3  validates 
the  performance  of  the  artificial  neural  network  reconstructors.  The  performance  validation  is 
obtained  by  comparing  the  artificial  neural  network  reconstruaor  performance  to  the  baseline 
established  in  experiment  1.  Section  4.4  addresses  experiment  3.  Section  4.4  examines  the  effects  of 
slope  measurement  weight  sparsening  on  the  artificial  neural  network  reconstruaors  performance. 
Sparsening  is  accomplished  by  'zeroing  out'  two  and  four  slope  measurement  weights  to  each  neuron. 
The  final  experiment  is  examined  in  section  4.5.  Section  4.5  examines  the  impact  of  neural  network 
training  on  system  performance. 
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4.2  Experiment  #i  •  Establishing  a  Compamon  Baseline 

The  analysis  and  simulation  began  with  the  establishment  of  the  comparison  baseline.  The 
comparison  baseline  system  uses  the  least  squares  reconstructor  linear  estimation  technique.  The 
comparison  baseline  results  are  generated  from  an  unmodified  version  of  the  simulation  code  provided 
by  the  Phillips  Laboratory,  Kirtland  AFB  NM. 

4.11  Sysum  Parameters.  The  adaptive  optical  system  parameters  used  for  the  least  squares 
and  artificial  neural  network  reconstructors  are  shown  in  Table  L  The  variations  in  light  level  were 
chosen  to  encompass  a  wide  range  of  operation.  The  range  chosen  is  consistent  with  the  light  level 
variations  examined  by  Welsh  (38).  The  changes  in  atmospheric  coherence  diameter  were  based  upon 
the  many  factors.  The  range  of  coherence  diameter  variations  coincides  with  the  range  presented  by 
Goodman  (13).  The  aaual  values  for  the  coherence  diameters  were  chosen  to  examine  the  effects  of 
partial  coherence  and  small  aperture  diameter  to  r,  ratios. 


Ligta  Level 

Figure  4.1  BaseiitH!  Adaptive  Optical  System  Results  -  Least  Squares  Reconstructor 
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4.12  Baseline  System  Results.  The  results  of  the  analysis  of  the  baseline  adaptive  optical 
system  (least  squares  reconstructor)  is  shown  in  Figure  4.1.  The  baseline  or  least  squares 
reconstructor  uses  eq  3.9  to  calculate  the  deformable  mirror  actuator  commands.  The  response  of  the 
linear  estimator  (Fig  4.1)  is  consistent  with  the  predicted  system  sensitivity.  The  esublishment  of  this 
system  baseline  can  now  be  contrasted  to  the  performance  obtained  using  artificial  neural  network 
reconstmaors.  The  results  for  this  comparison  are  presented  in  subsequent  sections. 

4.3  Experiment  02  •  Artificial  Neural  Network  Reconstructor  Results 

This  section  presents  the  analysis  results  for  both  artificial  neural  network  reconstructors. 
The  single  layer  perceptron  results  are  presented  first,  followed  by  the  analysis  results  for  the  Hopfield 
artificial  neural  network  reconstmaor. 

4.3.1  Sin^  Layer  Perceptron  Art^ial  Neural  Network  Reconstructor.  The  results  of  section 

3.4.2  showed  that  the  single  layer  perceptron  artificial  neural  network  reconstructor  mapped  into  the 
exaa  same  form  as  the  least  squares  reconstructor.  Therefore,  the  results  shown  in  Figure  3.1  also 
represent  the  results  obtained  using  a  single  layer  perceptron  artificial  neural  network  reconstructor. 

4.3.2  Hopfield  Artificial  Neural  Network  Reconsiructor.  The  recurrent  artificial  neuial 
network  reconstructor  developed  in  seaion  3.4.4  was  integrated  into  the  adaptive  optical  system 
simulation  code.  The  Hopfield  neural  network  is  represented  by  a  discrete  time,  synchronously 
updated  system.  The  disaete  time  nature  implies  that  the  time  intervals  used  to  perform  calculations 
and  updates  were  discrete.  The  feedback  values  were  synchronously  updated  to  each  Hopfield  neuron. 
The  Hopfield  artificial  neural  network  reconstructor  was  subjected  to  the  exact  same  parametric 
variations  as  the  baseline  least  squares  reconstmaor. 

The  results  for  the  Hopfield  neural  network  are  grouped  according  to  the  atmospheric 
coherence  diameter  used.  The  first  case  used  an  atmospheric  coherence  diameter  of  7.5  cm  (Case  1). 
The  general  form  of  the  reconstmaor  followed  the  baseline  reconstmaors  form.  The  Hopfield 
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UfU  Lgwl  (pkotcjcoBt>>ViBb>a/h»t  toe) 
Figure  4.2  Experiment  2  -  Case  1  Results 


Figure  4J  Experiment  2  -  Case  1  Results,  Effects  of  Bias  Compensation 

artificial  neural  network  reconstruaor  resulted  in  a  bias  ofBct  to  each  deformable  mirror  actuator 
command.  The  bias  offset  represents  a  piston  or  constant  phase  error  to  the  reconstruaed  phase. 
The  presence  of  a  piston  error  to  an  adaptive  optical  imaging  system  is  of  no  consequence.  The  effect 
of  removing  the  piston  enor  is  shown  in  Figure  4.3.  Figure  4.3  shows  the  difference  between  the  least 
squares  reconstruaor  and  the  Hopfield  neural  network  reconstruaor  with  and  without  the  piston 
error.  A  consunt  piston  error  was  removed  firom  each  aauator  after  the  Hopfield  neural  network 
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Figure  4.4  Experiment  2  •  Case  2  Results 


U^Laerf  (photocoaci^iab^^  t&M) 

Figure  4.5  Experiment  2  •  Case  2  Results,  Effects  of  Bias  Compensation 

converged.  Fig  4.3  shows  that  the  average  difference  between  the  HopGeld  artificial  neural  network 
reconstnurtor  and  the  least  squares  reconstructor  when  the  bias  is  removed  is  quite  small.  Therefore, 
the  Hopfield  artificial  neural  network  reconstructor  with  bias  removal  obtains  performance 
commensurate  with  the  least  squares  and  the  single  layer  perceptron  reconstnictor.  Figs.  4.4  through 
4.7  show  similar  results  for  different  atmospheric  coherence  diameters.  Case  2  coincides  with  an  r„ 
of  10  cm  and  Case  3  coincides  with  an  r^  of  20  cm. 
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Figure  4.6  Experiinent  2  •  Case  3  Results 
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LigkLeool  (phaMGeoDti^Wbq^ tkw) 

Figure  4.7  Experiment  2  -  Case  3  Results,  Effects  of  Bias  Compensation 

Although  a  constant  piston  error  is  insigniHcant  in  an  adaptive  optical  imaging  system,  the 
origin  of  the  error  must  be  analyzed.  The  origin  of  the  piston  error  was  studied  by  examining  all  of 
the  actuator  values  upon  network  convergence.  This  analysis  found  that  each  aauator  command  was 
ofket  from  the  least  squares  reconstruaor  actuator  command  by  a  constant  value.  The  origin  of  the 
bias  o^t  can  have  two  possible  otplanations.  First,  the  energy  function  of  the  Hopfield  artificial 
neural  network  reconstnictor  is  examined  in  Appendix  A.  The  results  of  analysis  show  that  the  size 
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neural  network  reconstructor  is  examined  in  Appendix  A.  The  results  of  analysis  show  that  the  size 
of  the  steps  taken  toward  a  energy  minimum  becomes  increasingly  smaller  as  the  minimum  is 
approached.  This  effect  results  in  a  smaU  finite  error  between  the  aaual  actuator  command  and  the 
aauator  command  the  network  converges  to.  Subsequent  sections  (Sect  4.3.2)  show  that  a  change 
in  system  inputs  to  a  Hopfield  neural  network  may  result  in  large  changes  in  the  system  output.  The 
effect  of  the  convergence  properties  coupled  with  the  recurrent  nature  may  result  in  deviations  in  the 
system  output  from  desired  outputs.  The  second  cause  may  be  the  lack  of  a  constant  bias 
compensation  term  to  each  Hopfield  neuron.  The  effects  of  a  constant  bias  compensation  term  have 
been  explored  in  response  to  the  presence  of  multiple  minimas  (30).  This  technique  of  minima 
compensation  is  known  as  "simulated  annealing.*.  The  technique  of  simulated  annealing  has  been 
shown  to  produce  a  single  global  minimum.  Simulated  annealing  is  implemented  in  the  Hopfield 
artificial  neural  network  by  introducing  a  bias  current  to  each  Hopfield  neuron.  A  similar  technique 
could  be  used  to  reduce  the  bias  in  the  networks  examined  for  this  thesis. 

4.4  Experiment  03  ■  Examination  of  the  Effects  ofSparsening 

This  section  examines  the  effects  of  reducing  the  number  of  slope  measurements  available  to 
each  artificial  neural  network  neuron  or  "sparsening.*  Each  neuron  must  now  calculate  an  actuator 
command  using  less  information.  The  intent  of  sparsening  is  to  reduce  the  number  of 
interconnections  required.  This  reduaion  in  interconnections  should  equate  to  an  overall  reduction 
is  system  complexity  and  compuutional  frame  time. 

The  method  used  to  determine  which  slope  inputs  to  neglect  was  made  by  examining  the 
weights  on  the  slope  measurements  to  the  Hopfield  artificial  neural  network.  This  method  neglected 
specific  slope  sensor  measurements  by  zeroing  out  the  least  significant  weights  to  each  neuron.  Table 
111  shows  which  slope  sensor  subaperture  measurements  were  negleaed.  Table  111  shows  which  2 
(S.(2))  and  4  (Sa(4))  slope  sensor  subaperture  measurements  that  were  neglected  for  each  actuator. 
For  example,  for  aauator  1,  the  least  significant  weights  were  those  conneaing  slope  measurements 
4  and  8  to  the  1st  neuron.  The  four  least  significant  weights  were  the  weights  conneaing  the  2,  3,  4, 


Table  in  Spaisened  Slope  Measurement  Data 


Aauator  # 

S,  (2) 

S,(4) 

Actuator  # 

S„(2) 

S„  (4) 

1 

4,18 

23,4,18 

12 

18,19 

5,4,19,20 

2 

4,18 

4,12,16,18 

13 

15,18 

8,12,15,18 

3 

12,18 

4,12,16,18 

14 

2,7 

13,6,7 

4 

U 

13,4.13 

15 

2,6 

133.6 

5 

4,18 

23.4,18 

16 

15,19 

15,16,18,19  1 

6 

3,18 

3,4,8,18 

17 

5,15 

5,10,15,18 

7 

12,18 

8,12,16,18 

18 

15,18 

5,10,15,18 

8 

12,18 

8,12,15,18 

19 

5,11 

13.6,11 

9 

2,11 

2,6,7,11 

20 

5,7 

5,6,7,11 

10 

10,18 

23.10,18 

21 

5,15 

5,10,11,15 

11 

18,19 

15,18,1930 

and  18th  slope  measurements  to  the  1st  neuron.  The  same  slope  measurements  that  were  negleaed 
for  the  Hopfield  artificial  neural  network  reconstructor  were  also  neglected  for  the  single  layer 
perceptron  reconstructor. 

4.4  1  f  ryr'  Perceptron.  Th?  rff?cts  of  sparsening  is  examined  for  the  single  layer 

perception.  The  weights  were  set  to  zero  for  the  specific  sparsening  case.  The  zero  weights 
corresponds  to  zeroing  out  the  appropriate  matrix  coefficients.  The  results  of  the  sparsening  of 
the  single  layer  perception  network  is  shown  in  Figure  4.8.  Figure  4.8  shows  that  the  performance 
characteristics  of  the  spaisened  single  layer  perceptron  artificial  neural  network  are  degraded. 

4.4.2  Hoiiield  The  results  for  the  spaisened  Hopfield  artificial  neural  network  is  shown  in 
Figure  4.9.  For  the  Hopfield  artifidai  neural  network,  the  sparsening  did  not  correspond  to  zero  out 
the  coefficients.  The  determination  of  the  input  weights  is  attributed  to  the  lack  of  one  to  one 
correspondence  between  input  weights  to  the  Hopfield  artificial  neural  networks  and  the  matrix 
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Figure  4^  Experiment  3  -  Sparsened  single  layer  perceptron  results 


Figure  4.f  Experiment  3  -  Sparsened  HopHeld  artificial  neural  network  results 


coefficients.  The  sparsening  of  the  Hopfield  artificial  neural  network  reconstruaor  zeroes  out  the 
AMjB  coefficients  (Eq  3.21).  These  zeroed  out  coefficients  are  combinations  of  the  differences  in  the 
Mjg  matrix  and  a  shifted  version  of  the  matrix.  The  aaual  effects  of  zeroing  out  the  M,„ 
coefficients  (single  layer  perceptron  sparsening)  and  producing  the  matrix  was  examined.  For 
example,  in  some  cases  zeroing  out  4  coefficients  and  then  producing  the  aM^  matrix  resulted 
in  the  zeroing  out  of  only  2  aM^  coefficients. 
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4.5  Experiment  04  -  Training  Results 

The  input  weights  to  the  single  layer  perceptron  and  the  Hopfield  artificial  neural  network 
reconstructors  were  trained.  The  goal  of  training  was  to  determine  if  artificial  neural  network  training 
could  compensate  for  the  lack  of  information  presented  by  sparsening.  The  algorithms  used  to  train 
the  artificial  neural  network  reconstruaors  are  found  in  Appendices  B  and  C. 

4.5.1  Sin^  Layer  Perceptron  Artificial  Neural  Network  Reconstructor.  The  training  results  of 
the  single  layer  perceptron  artificial  neural  network  reconstructor  are  depiaed  in  Figure  4.10.  The 


Figure  4.10  Experiment  4  •  Single  Layer  Perceptron  Training  Results 

single  layer  perceptron  was  trained  using  a  training  and  test  set  composed  of  400  training  pairs  per 
epoch.  The  light  level  was  100  photocounts/^ubaperture/integration  time,  and  the  atmospheric 
coherence  diameter  was  10  cm.  Figure  4.10  shows  that  the  performance  of  the  sparsened  artificial 
neural  network  reconstructor  was  improved  by  training.  At  light  level  above  approximately  7 
photocounts/subaperture/integration  time,  the  trained  artificial  neural  network  provided  better 
compensation  for  wavefront  aberrations.  The  lack  of  performance  of  the  trained  artificial  neural 
network  reconstructor  at  low  light  levels  is  explained  by  the  ability  of  the  net  to  learn  the  probability 
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distribution  of  tbe  noise.  At  high  light  lewis,  the  noise  variance  is  smaller  than  the  noise  variance 
at  low  light  levels.  Hence,  training  at  high  light  levels  would  teach  the  network  the  noise  variance 
properties  using  small  variances.  You  could  not  expect  the  network  to  provide  accurate  estimates 
when  the  noise  variance  gets  markedly  larger  than  from  what  the  network  learned. 

4.6  Summary 

This  chapter  presented  the  analysis  and  simulation  results.  Section  4.3  showed  that  the 
artificial  neural  network  reconstructors  could  achieve  performance  commensurate  with  conventional 
linear  estimation  techniques.  In  an  effort  to  reduce  the  number  of  hardware  interconnections,  the 
effects  of  sparsening  were  examined  in  section  4.4.  The  result  of  section  4.4  was  that  the  sparsened 
recurrent  anificial  neural  network  reconstructor  experienced  a  larger  performance  degradation.  The 
goal  of  section  4.5  was  to  find  out  if  artificial  neural  network  training  principles  could  be  applied  to 
compensate  for  input  sparsening.  Section  4.5  showed  that  a  reduction  in  hardware  interconneaions 
could  be  coupled  with  backpropagation  training  principles.  The  result  was  performance  enhancements 
not  currently  achievable  using  conventional  linear  estimation  techniques. 
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V.  Conclusions  and  Recommendations 


5.1.  Introduaion 

This  thesis  researched  the  implementation  of  baseline  artificial  neural  network  reconstruaors 
with  an  adaptive  optical  imaging  system.  This  research  examined  the  performance  of  the  single  layer 
perceptron  and  the  Hopfield  artificial  neural  network  reconstnictors.  The  reconstruaors  were 
integrated  with  an  adaptive  optical  imaging  system  that  was  composed  of  a  wavefiont  sensor  with  20 
slope  sensor  subapertures  and  a  21  individually  controlled  aauator  deformable  minor. 

The  entire  process  was  written  in  Fortran  77  and  simulated  on  a  VAX  VMS  system.  The 
main  simulation  code  was  provided  by  the  Phillips  Laboratory,  Kirtland  AFB  NM.  The  software 
written  for  this  thesis  consisted  of  the  code  to  calculate  the  Hopfield  input  weights,  the  artificial 
neural  network  reconstruaor  subroutines,  and  the  artificial  neural  network  training  algorithms. 

This  chapter  presents  a  brief  summary  of  what  was  accomplished  by  this  research  effort  This 
summary  of  accomplishments  is  followed  by  a  summary  of  recommendations  for  future  research  into 
this  area. 

5.2.  Conclusions 

1.  The  artificial  neural  network  reconstruaor  is  a  feasible  alternative  to  conventional  linear 
estimation  techniques.  The  artificial  neural  network  is  an  extremely  fast  densely  packed, 
highly  parallel  distributed  processing  struaure  composed  of  many  simple  processing  elements. 
These  features  alone  merit  consideration  of  the  artificial  neural  network  as  an  alternative  to 
conventional  linear  estimators.  This  research  also  demonstrated  that  there  is  virtually  no 
performance  difference  between  artificial  neural  networks  and  the  conventional  least  squares 
reconstruaor. 

Z  The  added  feature  of  artificial  neural  network  training  gives  the  artificial  neural  network 
the  ability  to  provide  performance  enhancements  not  currently  achievable  using  conventional 
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linear  estimators.  This  performance  enhancement  was  demonstrated  by  the  trained  single 
layer  perceptron  using  only  16  of  20  slope  sensor  inputs. 

3.  The  recurrent  artificial  neural  network  can  provide  performance  commensurate  with 
conventional  linear  estimators  and  single  layer  perceptrons.  The  feedback  nature  of  recurrent 
neural  networks  did  not  result  in  the  system  robustness  originally  envisioned.  Instead,  the 
recurrent  nature  may  result  in  inherent  instabilities  and  a  reduction  in  the  systems  ability  to 
compensate  for  the  lack  of  essential  information. 

5.3  Recommendations 

This  research  represented  an  initial  effort  at  AFIT  to  integrate  two  diverse 
technologies;  adaptive  optics  and  artificial  neural  networks.  A  general  recommendation  is 
that  the  entire  simulation  should  be  moved  to  a  workstation  environment.  Throughout  this 
effort,  the  mainframe  computer  reliability  was  quite  unpredictable.  The  only  pitfall  is  that 
a  workstation  environment  would  require  rewrite  of  the  IMSL  subroutines  used  by  the 
simulation.  A  residual  effea  of  this  groundbreaking  effort  was  many  simplifications  were 
made  to  complete  the  research  in  the  allotted  time.  Therefore,  the  recommendations  for 
future  research  are  many  and  grouped  into  two  basic  areas;  adaptive  optical  imaging  systems, 
and  artificial  neural  networks. 

5.3.1  Adaptive  Optical  Imaging  System.  This  research  was  based  upon  a  nominal 
amount  of  wavefront  slope  sensors  and  deformable  mirror  actuators.  A  logical  extension  is 
to  increase  the  amount  of  slope  sensors  and  deformable  mirror  actuators.  By  increasing  the 
both  the  amount  of  slope  sensor  subapertures  and  deformable  mirror  actuators,  the  effects 
of  reducing  the  number  of  slope  sensor  inputs  or  'sparsening”  can  be  further  researched. 
Coupled  with  the  increase  in  the  slope  sensor  subapertures  and  deformable  mirror  actuators 
should  be  an  increase  in  the  mirror  diameter. 
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Further  research  should  eramine  different  techniques  to  reduce  the  amount  of  slope 
information  used  by  each  neuron.  This  research  used  the  weights  of  each  slope  sensor 
measurement  to  determine  the  importance  of  each  slope  m'^asurement  to  a  particular  neuron. 
Spatial  location  should  be  explored  as  means  of  reducing  the  amount  of  slope  measurements 
input  to  each  neuron. 

The  mirror  surface  may  not  be  flat  when  actuator  voltages  are  set  to  zero.  Additional 
mirror  data  is  needed  to  model  these  effects.  A  bias  may  be  added  to  selected  deformable 
minor  actuator  commands  to  compensate  for  this  effect  The  effect  of  distina  actuator 
influence  funaions  can  be  modeled. 

The  temporal  aspects  of  the  system  also  need  to  be  researched.  An  instanuneous 
correaion  was  assumed  for  this  thesis.  The  effects  of  finite  and  random  time  delays  between 
actuator  command  and  movement  need  to  be  addressed.  The  temporal  aspects  of  the 
waveffont  sensing  process  need  to  be  analyzed.  The  Hartmann  sensor  can  undergo  slow 
transients  when  detecting  time  invariant  phase  distortions.  These  slow  transients  are 
attributed  to  the  sensor  Vanning  up*.  These  effects  can  '«  modeled  as  a  random  bias  for 
each  slope  channel. 

A  hardware  system  composed  of  a  continuous  facesheet  deformable  mirror  and  a 
Hartmann  wavefront  sensor  is  available  for  use  to  provide  verification  of  the  software 
simulation  results.  Operational  testing  and  evaluation  of  this  system  should  provide 
additional  data  that  can  be  integrated  with  the  software  simulation. 

5.3.2  Artificial  Seural  Networks.  The  baseline  anificial  neural  networks  studied  in 
represented  two  representative  networks  for  two  classes;  recurrent  and  feedforward. 
Additional  research  into  additional  types  of  artificial  neural  networks  is  highly  recommended. 
Many  fundamental  questions  still  remain  unanswered.  For  instance,  historically  there  has 
been  two  types  of  linear  reconstructors;  minimum  variance  and  least  squares.  Can  an 
artificial  neural  network  somehow  be  trained  to  provide  performance  commensurate  with  the 
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minimum  variance  but  have  the  robustness  of  the  least  squares  reconstructor  ?  Can  a  neural 
network  compensate  for  idiosyncrasies  of  the  system  components  through  training.  Research 
into  multilayer  perceptron  type  networks  is  essential  Recently,  a  recurrent  backpropagation 
network  was  demonstrated  at  AFTT  with  great  success.  The  possibility  of  using  this  type  of 
network  to  perform  wavefront  reconstruction  is  encouraged.  This  thesis  outlined  the 
fundamental  implementation  of  the  artificial  neural  network  into  the  adaptive  optical  imaging 
system.  The  fundamenul  goal  of  future  research  should  be  geared  to  find  the  best  candidate 
neural  networks  for  implementation  into  the  adaptive  optical  imaging  system  environment. 

The  artificial  neural  networks  used  in  this  thesis  effort  did  not  use  a  bias  term  into 
each  neuronal  element.  This  bias  term  should  be  added  to  each  neuron  of  both  the  recurrent 
and  the  Hopfield  artificial  neural  networks.  The  effects  of  this  bias  on  the  weight  correlation 
to  known  reconstructor  matrix  coefficients  should  be  examined.  For  classifiers,  the  bias 
allows  partitioning  of  the  feature  space  by  offsetting  the  hyperplane  locations.  For  funaion 
estimation,  the  effea  of  this  bias  term  must  be  analyzed  in  depth.  The  effea  of  the  bias  on 
training  should  also  be  addressed.  Training  the  network  without  bias  is  analogous  to  'tieing 
one  hand  behind  the  networks  back”.  The  bias  adds  an  additional  degree  of  freedom  to  the 
training  algorithms.  This  added  degree  of  freedom  and  its  effect  on  training  time  and 
learning  rates  should  be  analyzed.  Analysis  into  the  use  of  backpropagation  with  momentum 
and  conjugate  gradient  training  techniques  is  encouraged. 

Software  simulation  is  meant  to  be  a  predictor  of  aaual  haidware  implementations. 
A  hardware  implemenution  is  now  possible  and  encouraged  here  at  AFTT.  Neural  network 
integrated  circuits  are  currently  available,  coupled  with  the  deformable  mirror  and  Hartmann 
wavefront  sensor  available  at  AFTT,  increased  research  into  the  spatial  and  temporal  aspects 
is  essential.  Concurrence  of  hardware  analysis  and  testing  with  software  simulation  is 
encouraged. 
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Appendix  A:  Hopfield  Artificial  Neural  Network  Ener^  Function  Analysis 


A I  Introduction 

The  convergence  of  this  type  of  neural  network  is  essential  to  accurate  wavefront  phase 
estimates.  The  energy  function  for  a  Hopfield  neural  network  describes  the  network  convergence 
properties.  Close  examination  of  the  gradient  of  the  energy  funaion  defines  the  convergence 
properties  for  this  type  of  neural  network.  The  energy  function  of  the  Hopfield  neural  network  with 
a  continuous  work  funaion  has  unique  properties.  The  energy  funaion  of  the  Hopfield  neural 
network  with  a  continuous,  linear  aaivation  funaion  is  described  by  Eq  A.1  (18:3089-3090).  Using 
the  same  notation  as  seaion  3.4.3,  for  a  unity  gain,  the  inverse  I/O  relationship  reduces  to  simply  V„ 
which  gives  us 

£  *  -  i  y'  T.W. 

2  ^-1  Z-J.l  y  I  I 
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Examination  of  the  gradient  of  the  energy  funaion  with  respect  to  the  input  voltage  V;,  yields 
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Analyzing  this  definition  of  the  change  in  the  energy  with  respect  to  a  change  in  the  neuronal  input 
yields  interesting  results.  For  identical  neurons  with  a  constant  input  resistance  R,  the  right  hand  side 
of  the  gradient  equation  results  in  the  three  cases  shown  in  Eq  (A.3).  The  first  case  results  in  a 
nej^jtive  gradient.  The  second  case  occurs  at  convergence  for  the  energy  surface  or  a  minimum.  The 
third  case  results  in  a  positive  gradient.  For  each  case,  the  magnitude  of  the  gradient  is  equivalent 
to  the  difference  between  the  left  hand  side  and  the  right  hand  side  of  each  case. 
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(1)  Iv  *  I  <  ly^  T.  V 

(2)  1  V  *  I  ^  1  y  T.  V 

Q)  Iv  *  I  >  ly  TV 

The  form  of  the  energy  surface  is  found  by  applying  the  second  derivative  test  to  the  energy  gradient. 
The  second  derivative  of  the  energy  function  yields  2/R,.  Since  the  resistance  is  always  positive  e 
energy  function  is  concave  upwards.  The  critical  values  for  the  second  derivative  test  are  found  by 
solving  for  the  energy  gradient  of  zero  ,  Case  2 .  The  second  derivative  test  reveals  that  these  critical 
values  represent  minimums  on  the  energy  surface.  Figure  3.1  is  a  graphical  representation  of  the 


Figure  A.1  Energy  Surface  Representation 


energy  surface  outlined  by  the  previous  analysis. 

The  energy  function  provides  a  gradient  descent  down  the  energy  surface  toward  a  minimutn. 
The  magnitude  of  the  gradient  determines  the  size  of  the  steps  taken  toward  a  minimum.  Therefore, 
as  the  difference  in  the  weighted  output  versus  the  weighted  inputs  decreases  (Eq  A.3  cases  1,3),  the 
movement  down  the  energy  surface  decreases  accordingly.  The  Hopfield  network  step  size  is  also 
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decreasing  until  a  minima  is  reached.  This  step  size  reduction  results  in  a  slowing  down  of  the 
movement  down  the  energy  surface  as  the  minima  is  approached. 

A  useful  contrast  is  made  by  examining  the  associative  memory  application  of  the  Hopfleld 
neural  network.  The  step  size  of  an  associative  memory  is  based  upon  the  relationship  of  the  energy 
gradient  to  a  threshold  value.  The  hard  limiter  used  in  the  associative  memory  Hopfield  neural 
network  results  in  a  discrete  step  size.  The  step  size  of  the  associative  memory  Hopfield  neural 
network  has  values  of  0,1,2.  The  discrete  step  size  results  in  extremely  fast  convergence  toward  a 
minimum.  The  Hopfield  network  studied  for  this  thesis  has  continuous  outputs,  this  results  in  a 
larger  variation  in  the  step  sizes  taken  toward  an  energy  minimum.  The  variations  in  step  sizes 
coupled  with  the  decrease  in  step  sizes  as  the  minimum  is  approached  has  resulted  in  an  increase  in 
the  number  of  iterations  taken  to  converge.  The  problem  with  this  type  of  minimum  seeking  algorithm 
is  the  presence  of  local  minimums. 

A.  2  Uniqueness  of  Hopfield  Network  Solution.  This  type  of  neural  network  reconstnictor  usefulness 
is  based  upon  the  convergence  to  an  accurate  phase  estimate.  Zhao  and  Mendel  (41:499-506)  proven 
that  a  neural  network  with  a  specific  form  will  converge  to  a  unique  solution  defined  by  the  initial 
conditions.  A  unique  output  is  an  output  with  a  single  or  global  minimum.  What  this  infers,  is  that 
under  specific  conditions  this  class  of  neural  netwui  k  will  calculate  a  unique  answer  for  unique  inputs. 
For  the  adaptive  optical  imaging  system,  this  equates  to  a  unique  phase  estimate  for  a  unique  set  of 
system  conditions. 

The  form  of  the  network  examined  by  Zhao  and  Mendel  corresponds  to  a  Hopfield  neural 
network,  with  specific  assumptions.  The  assumptions  employed  by  Zhao  and  Mendel  apply  to  the 
neural  network  studied  for  this  thesis  and  are  as  follows 

(1)  Feedback  interconnection  matrix  is  symmetric 

(2)  No  self  feedback  for  individual  neurons 

(3)  Initial  condition  for  the  neuronal  outputs  is  zero 

(4)  Neuron  outputs  are  bounded  within  a  well  defined  region 
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(5)  Linear  gain  within  the  output  boundaiy  region 

(6)  All  neurons  have  the  same  I/O  characteristics. 

The  lack  of  local  minimums  alleviates  the  concern  that  the  neural  network  reconstructor  may 
provides  erroneous  actuator  commands. 
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Appendix  B:  Sin^  Layer  Perceptron  Artaud  Neural  Network  Training 


B.1  Introduction 

This  appendix  examines  the  training  technique  used  for  the  single  layer  perceptron  artificial 
neural  network  reconstruaor.  The  artificial  neural  network  training  used  is  the  backward  error 
propagation  technique  or  simply  "backprop.*  This  thesis  used  the  insuntaneous  method  of 
backpropagation  training.  The  instantaneous  method  minimizes  the  error  associated  with  a  single 
training  exemplar.  Weight  updates  are  made  for  each  training  exemplar  presented  to  the  artificial 
neural  network. 

B.2  Instantaneous  Backpropagation  Method 

The  instantaneous  backprop  method  begins  by  defining  the  total  error  for  the  neuron,  Cj 

2$ 

Cj  »  (d.  -  c^.)  (B*l) 

where  dj  is  the  desired  output  and  Cj  is  the  calculated  output  for  the  j*  neuron.  The  toul  energy  (E^) 
is  defined  as  the  sum  of  the  individual  mean  squared  errors  or 

■  i;  5  H 

The  weight  updates  are  implemented  using  the  first-order  gradient  steepest  descent  (27:103.)  The 
weight  update  equation  for  the  input  interconneaion  weights  (Wy)  is  defined  by 


The  new  weight,  is  written  in  terms  of  the  old  weight,  w^',  the  energy,  and  a  learning  rate,  r;.  The 
single  layer  perceptron  network  is  depicted  in  figure  1.5.  The  task  that  remains  is  the  calculation  of 
the  weight  change,  aWi^.  Using  equation  B.2,  the  change  in  weight  becomes 
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The  calculated  output  of  the  single  layer  perception,  is  define  by 


c. 

J 


W.I, 


«>) 


where  f,,  is  the  aaivation  funaion  or  nonlinearity,  is  the  i"'  input  feature  to  the  j"  neuron,  and  6, 
is  the  bias  term.  For  this  thesis  the  activation  function  used  is  the  sigmoid  or  "squashing”  function. 
When  the  argument  of  the  activation  is  represented  by  a(W|j),  the  calculated  output  using  a  sigmoidal 
activation  function  is  now 
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The  partial  derivative  in  equation  B.4  is  now 
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Using  equations  B.7  and  B.3,  the  equation  for  the  weight  updates  becomes 

w:  -  w:  ♦  {d.  -c.)  c.  (1  -  cp  r 


(B.8) 


To  implement  this  algorithm,  the  following  steps  were  used; 


B-2 


(1)  Initialize  weights  and  thresholds  to  small  random  numbers. 

(2)  Statistically  normalize  the  input  features 

(3)  Present  training  vector  and  desired  output 

(4)  Calculate  the  actual  output  using  B.S 

(5)  Update  weight  using  B.S,  where  0  <  s  1 
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Appendix  C:  Hopfield  Artificial  Neural  Network  Training 


C.l  Introduction 

This  thesis  will  examine  if  the  conventional  Hopfield  neural  network  can  be  trained  to  provide 
performance  enhancements  under  varying  system  and  hardware  conditions.  The  algorithm  developed 
as  part  of  this  research  effort  begins  with  the  conventional  backpropagation  algorithm. 

The  conventional  Hopfield  artificial  neural  network  utilizes  the  feature  of  fixed 
interconnection  weights  to  provide  extremely  fast  convergence  to  a  minimum  on  the  energy  surface. 
The  recurrent  backpropagation  neural  network  is  a  special  case  of  the  Hopfield  neural  network.  The 
main  distinctions  between  the  Hopfield  artificial  neural  network  and  the  recurrent  backpropagation 
neural  network  are  that  the  recurrent  backpropagation  neural  network  uses  a  conventional 
backpropagation  training  algorithm  to  adapt  both  feedback  and  feed  forward  (input)  weights.  The 
recurrent  backpropagation  neural  network  is  not  designed  to  iterate  until  convergence  like  a  Hopfield 
neural  network,  but  instead  is  developed  as  a  predictor  of  future  neuron  output  states.  Due  to  this 
future  state  predictor  application,  the  recurr‘;nt  backpropagation  neural  network  also  assumes  that 
there  is  some  temporal  relationship  between  input  training  exemplars.  This  thesis  will  develop  a 
neural  network  that  diverges  from  the  conventional  Hopfield  and  recurrent  backpropagation  neural 
networks.  This  hybrid  network  will  have  fi.xed  feedback  interconnection  weights  and  will  train  the 
input  weights.  There  are  numerous  reasons  for  this  methodology.  First,  the  feedback  interconnection 
weight  matrix  is  well  defined.  Altering  of  the  feedback  interconnection  weight  matrix  may  in  fact 
jeopardize  the  stability  criterion  fundamental  to  this  type  of  recurrent  neural  network.  Another 
reason  is  that  due  to  the  sheer  magnitude  of  the  problem,  the  reduced  weight  training  set  will  equate 
to  a  reduction  in  overall  neural  network  training  time.  The  general  system  to  which  the  training 
algorithm  is  to  be  applied  is  shown  in  Fig  (Cl). 

There  are  fundamentally  two  ways  to  implement  the  backpropagation  training  algorithm,  the 
batch  and  instantaneous  methods.  The  batch  backpropagation  training  algorithm  calculates  the 
ensemble  training  exemplar  error,  using  the  sum  of  the  instantaneous  errors  (£k'’(^))  training 
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Figure  C.l  Generic  Recurrent  Neural  Network  Architeaure 

set  as  the  measure  to  minimize.  The  superscript  p  denotes  an  instantaneous  value,  and  the  subscript 
k  denotes  the  k*^  neuronal  element  The  t,  is  the  time  when  the  network  settles  or  converges  to  an 
answer.  For  this  thesis  effort  convergence  stability  is  defined  as  100  successive  iterations  where  the 
new  output  is  less  than  0.1%  different  than  the  previous  output  The  batch  backpropagation  method 
defines  the  energy  function  for  p  training  exemplars  as 

e.-.  ■  z;.,  z:.,  5  ('!('.))“ 


The  instantaneous  error  method  seeks  to  minimize  the  error  associated  with  a  single  training 
exemplar.  In  this  case  the  energy  function  is  defined  as  the  sum  of  the  squared  error  function. 

^  -  n.,  ^  (^*(0)' 
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In  both  cases,  the  weight  update  algorithm  uses  a  gradient  descent  down  the  error  surface  to  converge 
toward  a  minimum. 

C.2  Ensemble  Training  Technique 

For  the  purposes  of  this  thesis,  the  recurrent  backpropagation  learning  algorithm  will  be 
employed,  with  a  slight  twist.  The  weight  updates  will  not  occur  at  every  Hopfield  iteration,  but  will 
be  done  after  the  network  converges  to  a  solution  for  a  given  input  training  exemplar.  This  is  done 
because,  changing  the  weights  during  Hopfield  iterations,  interrupts  the  recursion  process  and  may 
result  in  nonconvergence  of  the  neural  network.  This  differs  from  the  conventional  recurrent 
backpropagation  network  training  algorithm,  which  trains  feedback  and  input  weights  after  each 
iteration.  A  recurrent  backpropagation  network  adopts  this  type  of  training  because  what  is  important 
is  the  temporal  relationship  between  iterations,  rather  than  the  output  upon  convergence. 

The  Hopfield  neural  network  is  concerned  with  the  output  value  at  the  time  of  convergence. 
For  this  application  the  neural  net  must  be  allowed  to  converge  to  a  solution,  whether  right  or  wrong. 
The  difference  between  this  solution  and  the  desired  solution  deflnes  the  instantaneous  error.  The 
instantaneous  error  for  that  exemplar  is  made  up  of  ar  '  ibutions  of  error  for  each  Hopfield  iteration. 
The  energy  function  analysis  must  then  be  propae  <ck  through  each  iteration  and  the  average 
weight  update  will  be  used  to  determine  the  new  v  The  weight  update  for  each  exemplar,  will 

be  made  prior  to  presentation  of  another  training  exemplar.  This  backpropagation  of  weight  updates 
is  represented  by  a  total  weight  update  equal  to  the  mean  of  the  weight  updates  that  would  have  been 
done  for  each  iteration.  This  thesis  assumes  that  the  neural  network  operational  characteristics  can 
be  modeled  by  a  discrete  time,  synchronously  updated  system.  If  the  process  begins  at  time  t,  and 
ends  at  time  t®i,=to+Q,  the  change  in  weights  per  training  exemplar  is  defined  by 
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At  this  point,  it  is  important  to  remember  that  the  weight  update  occurs  after  the  network  has 
converged  (t=tj,  and  not  at  each  iteration.  This  type  of  weight  update  scheme  is  represented  by 

+  AH^ 

For  each  Hopfield  iteration,  this  system  does  not  update  the  weights,  but  must  keep  track  what  the 
weight  updates  would  have  been.  Each  iteration  calculates  the  present  change  in  weights  by  using 
information  from  previous  weight  updates.  This  can  be  termed  a  "short  term  system  memory",  since 
knowledge  of  all  training  iterations  is  needed  to  calculate  a  weight  update  per  training  exemplar. 

By  examining  the  process  in  a  sequential  time  sense,  when  a  new  exemplar  is  presented,  the 
weight  update  for  the  past  training  session  represents  the  point  of  origin  for  the  weights  for  the  new 
training  exemplar.  Each  training  exemplar  defines  a  new  energy  function  in  energyAveight  space.  The 
gradient  descent  of  the  new  energy  function  begins  at  the  location  of  the  last  weights.  Therefore,  the 
calculation  of  the  new  weight  update  for  the  present  training  exemplar  is  a  gradient  descent  to  a 
minimum  in  reference  to  the  last  minimum.  This  sequential  process  of  updating  the  weights  for  many 
exemplars  can  now  be  termed  the  "systems  long  term  memory",  since  the  sequential  referencing  of  the 
present  to  the  past  weight  update  takes  into  account  the  effects  of  the  weight  updates  for  all  the  past 
training  exemplars.  A  large  ensemble  of  training  exemplars  will  be  presented  to  the  neural  network. 
The  presentation  of  a  large  set  of  exemplars  is  attempting  to  build  a  global  energy  surface  for  the 
random  process.  By  utilizing  the  long  term  memory  of  the  system,  it  is  hoped  that  generalization 
about  the  adaptive  optics  system  statistics  (global  energy  function)  will  occur.  This  generalization 
should  result  in  the  formation  of  an  optimal  set  of  weights  for  the  wavefronts  random  process  that 
correspond  to  the  location  of  the  global  energy  minimum  in  weight  space.  To  determine  the  ensemble 
training  algorithm,  the  iterative  qualities  of  that  ensemble  training  algorithm  must  be  defined. 

C.ll  Iterative  Aspects  of  the  Ensemble  Training  Algorithm.  To  analyze  the  time  dependent 
aspects  of  the  Hopfield  neural  network  we  must  first  define  the  temporal  transition  characteristics  of 
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the  Hopfield  neural  network.  To  accomplish  this,  an  analogous  network  to  Fig  (Cl)  is  shown  in  Fig 
(C2). 


Figure  C.2  Discrete  Time  Representation  of  the  Hopfield  Artificial  Neural  Network 

Where  the  subscript  Q  represents  the  time  at  which  convergence  occurs.  The  development  of  the 
instantaneous  backpropagation  learning  begins  with  the  definition  of  the  weight  update  equation  at 
any  time  t,  for  the  system  defined  in  Fig  (C.2). 

w^(r>l)  «  w  (0  ♦  A  w.  (r)  (CS) 

Using  this  notation,  at  a  time  t,  w^  is  defined  as  the  weight  from  the  i*^  input  feature  to  the  neuron. 
Where  Wjj(t)  represents  the  new  weight,  w^(t-l)  is  the  old  weight,  and  the  change  in  weights  is  denoted 
by  nwij(t).  The  weight  update  at  time  t  is  defined  as 
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The  negative  of  the  gradient  points  in  the  direction  of  greatest  decrease  or  descent  on  the  energy 
surface.  Thus  the  gradient  vector  provides  direction  along  the  surface  of  the  energy  function  which 
searches  for  a  minimum.  The  factor  »;  is  the  gain  or  the  learning  rate  of  the  training  algorithm. 
Applying  the  chain  rule  to  the  gradient  of  the  energy  function  with  respect  to  the  activation  results 
in 

dE7  _  dEF  ^^k(0  (C.7) 
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For  the  adaptive  optical  imaging  system  implemented  using  a  Hopfield  neural  network,  the  actuator 
command  k“’  neuron  is  defined  as 

■  A I K.,  c.  (,  - 1 ) .  e;.,  (0  Cl  (c-*» 

This  simply  states  that  the  output  of  the  k"*  ncurr/P.  at  any  time  t  is  described  by  a  nonl  ncarity  whose 
argument  is  the  sum  of  two  terms.  The  first  term  is  a  linear  combination  of  the  weighted  actuator 
outputs  one  time  step  earlier.  The  weights  are  defined  by  the  feedback  control  matrix.  For  siabi'ity 
purposes,  the  main  diagonal  of  the  feedback  interconnection  matrix  is  set  to  zero  and  off  diagonal 
terms  are  symmetric,  i.e.  T„^  =  T^^.  The  second  term  is  the  weighted  sum  of  the  m  input  features, 
this  is  the  set  of  weights  that  are  to  be  trained.  Examining  the  partial  derivative  of  the  output  of  the 
k"'  neuron  with  respect  to  ihe  input  weights  results  in 

The  detwative  looks  at  the  change  in  the  activation  of  the  actuator  command  at  time  t  with  respect 
to  the  weights  at  time  t.  Therefore  the  first  term  is  independent  of  the  change  in  weights  at  time  t. 
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and  equals  zero.  Examination  of  the  second  term  results  in  a  derivative  that  exists  only  when  m=i, 
and  k=j.  The  change  in  the  activation  of  the  actuator  command  with  respect  to  the  weights  at  time 
t  can  now  be  represented  by 
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Referring  back  to  the  definition  of  the  instantaneous  energy  (Eq  C.2),  yields 

dB  _  dB  ^  J  (C.ll) 

dw.(t)  '  dc^it)  ‘ 


Now  we  must  define  the  change  in  the  energy  of  the  network  with  respect  to  the  actuator  co, 
at  time  t.  The  at  any  time  (t+1),  energy  function  can  be  described  as  being  a  function  of  the 
individual  actuator  commands  at  the  same  time. 

B  =  £»’[c,(f  +  l).c,(r-^l) . c^(f+l)]  (C.I2) 

We  know  from  Eqn  3.22,  the  definition  of  the  i"*  actuator  command  at  time  (t  +  1)  is  a  function  of  he 
actuator  commands  one  time  step  earlier,  therefore  the  actuator  command  at  time  (t  +  1)  can  be 
defined  as 


Ci(f*l)  =  c^(c^(f)) 
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Observe, 
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Applying  the  chain  rule  to  Eq  (C.12)  results  in 
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The  last  partial  derivative  results  in  the  feedback  weight  for  the  ki'*’  neuron.  Eqn  3.50  can  now  be 


written  as 
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The  change  in  weights  for  each  iteration  is  now  defined  using  Eqs  (C.6,  C.ll,  C.16)  by 
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If  we  look  at  the  output  when  the  neural  network  converges,  the  error  function  is  described  by 

The  gradient  of  the  energy  function  with  respect  to  the  input  interconnection  weights  can  be  related 
to  the  instantaneous  error  by 
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(C.19) 


ap 


dc^iO  2 


Since  the  desired  outputs  remain  fixed  during  convergence,  the  change  in  the  desired  output  with 
respea  to  the  weights  is  zero.  The  derivative  is  only  defined  when  k^h,  and  therefore 


ap 


(C.20) 


Now  to  the  change  in  weights  per  iteration,  we  must  begin  after  the  network  has  converged  and 
propagates  backwards  to  t^ 

To  summarize  this  procedure,  the  steps  to  train  this  type  of  recunent  network  are  as  follows: 

(1)  Initialize  weights,  for  this  case  the  weights  are  initialized  to  the  least  square  estimator 

weight  values  to  reduce  training  time. 

(2)  Present  an  input  training  exemplar  and  let  network  converge 

(3)  Using  the  number  of  iterations  and  Eqs  C.19  and  C17,  propagate  backwards  calculating 

the  change  in  weights  per  iteration. 

(4)  Using  Eqs  C.S  and  C3,  update  weights. 

(5)  Present  another  training  exemplar  and  proceed  to  Step  3 
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